D INTELLECTUAL PROPERTY ORGANIZATION' 
International Bureau 



0 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C07D 207/34, 233/90, A61K 31/415, 
C07D 403/14, C12Q 1/68 



Al 



(11) International Publication Number: WO 98/37066 

(43) International Publication Date: 27 August 1998 (27.08.98) 



(21) International Application Number: 



PCT/US98/01006 



(22) International Filing Date: 



21 January 1998 (21.01.98) 



(30) Priority Data: 

PCT7US97/03332 
(34) Countries for 
international 
60/043,444 
60/042,022 
08/837,524 
08/853,522 
PCT/US97/ 12722 
(34) Countries for 
international 



20 February 1997 (20.02.97) WO 
which the regional or 

application was filed: US et al. 

8 April 1997 (08.04.97) US 

16 April 1997 (16.04.97) US 

21 April 1997 (21.04.97) US 
8 May 1997 (08.05.97) US 
21 July 1997 (21.07.97) WO 

which the regional or 

application was filed: US et al. 



US 

Filed on 



60/043,444 (CIP) 
8 April 1997 (08.04.97) 



(63) Related by Continuation (CON) or Continuation-in-Part 
(CIP) to Earlier Applications 

US 08/853,522 (CIP) 

Filed on 8 May 1997 (08.05.97) 

US 08/837,524 (CIP) 

Filed on 21 April 1997 (21.04.97) 

US 08/607,078 (CIP) 

Filed on 26 February 1996 (26.02.96) 

US 60/042,022 (CIP) 

Filed on 16 April 1997 (16.04.97) 



(71) Applicant (for all designated States except US): CALIFORNIA 

INSTITUTE OF TECHNOLOGY [US/ US]; Pasadena, CA 
91125 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): BAIRD, Eldon, E. 
[US/US]; 255 S. Madison Avenue #5, Pasadena, CA 91 101 
(US). DERVAN, Peter, B. [US/US J; 1235 St. Albans Road, 
San Marino, CA 91108 (US). 

(74) Agent: McDONNELL, John, J.; McDonnell Boehnen Hulbert 
& Berghoff, 300 South Wacker Drive, Chigago, IL 60606 
(US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, GW, HU, ID, IL, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MD, MG, MK, MN, MW, MX, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, TT, UA, UG, US, UZ, VN, YU, ZW, ARIPO patent 
(GH, GM, KE, LS, MW, SD, SZ, UG, ZW), Eurasian patent 
(AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent 
(AT, BE, CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 



(54) Title: IMPROVED POLYAMIDES FOR BINDING IN THE MINOR GROOVE OF DOUBLE STRANDED DNA 



(57) Abstract 

The invention encompasses improved polyamides for binding to specific nucleotide sequences in the minor groove of double 
stranded DNA The 3-hydroxy-N-methylpyrrole/N-methylpyrrole carboxamide pair specifically recognizes the T.A base pair, while the 
N-methylpyrroIe/3-hydroxy-N-methylpyrrole pair recognizes AT nucleotide pairs. Similarly, an N-methylimidizolc/N-methylpyrrole 
carboxamide pair specifically recognizes the G.C nucleotide pair, and the N-methylpyrrole/N-methylimidizole carboxamide pair recognizes 
the C.G nucleotide pair. 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


CE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


Gil 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BR 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


RF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


'IT 


Trinidad and Tobago 


BJ 


Benin 


IK 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mex ico 


uz 


Uzbekistan 


CK 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Vict Nam 


CG 


Congo 


KF 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CII 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


z\v 


Zimbabwe 


CI 


Cdte d'Woirc 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


C/cch Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






ok: 


Germany 


U 


Liechtenstein 


SD 


Sudan 






UK 


Denmark 


LK 


Sri Lanka 


SK 


Sweden 








Estonia 


LR 


Liberia 


SC. 


Singapore 







BNSDOCID: <WO 9837066A 1 _l_> 



WO 98/37066 



PCT/US98/01006 



IMPROVED POLY AMIDES FOR BINDING IN THE MINOR 
GROOVE OF DOUBLE STRANDED DNA 

5 The U.S. Government has certain rights in this invention pursuant to Grant Nos. GM 

26453, 27681 and 47530 awarded by the National Institute of Health. 

CROSS REFERENCE TO RELATED APPLICATIONS 

10 This application is a continuation-in-part of PCT/US97/03332 filed February 20, 1997, 

Serial No. 08/853,522 filed May 8, 1997 and PCT/US 97/12722 filed July 21, 1997 which are 
continuation-in-part applications of Serial No. 08/837,524, filed April 21, 1997, Serial No. 
08/607,078, filed February 26, 1996, provisional application Serial No. 60/042,022, filed April 
16, 1997 and provisional application Serial No. 60/043,444, filed April 8, 1997. 

15 

BACKGROUND OF THE INVENTION 

Field of the Invention 

20 This invention relates to polyamides which bind to predetermined sequences in the 

minor groove of double stranded DNA. 

Description of the Related Art 

25 The design of synthetic ligands that read the information stored in the DNA double helix 

has been a long standing goal of chemistry. Cell-permeable small molecules which target 
predetermined DNA sequences are useful for the regulation of gene-expression. 
Oligodeoxynucleotides that recognize the major groove of double-helical DNA via triple-helix 
formation bind to a broad range of sequences with high affinity and specificity. Although 

30 oligonucleotides and their analogs have been shown to interfere with gene expression, the triple 
helix approach is limited to purine tracks and suffers from poor cellular uptake. The 
development of pairing rules for minor groove binding polyamides derived from N- 
methylpyrrole (Py) and N-methylimtdazole (Im) amino acids provides another code to control 
sequence specificity. An Im/Py pair distinguishes G»C from OG and both of these from A«T 

35 or T«A base pairs. Wade, W.S., Mrksich, M. & Dervan, P.B. describes the design of peptides 
that bind in the minor groove of DNA at SXA^GCA^CCA/r)^ 1 sequences by a dimeric 
side-by-side motif. J, Am, Chem. Soc. 114, 8783-8794 (1992); Mrksich, M. et al describes 
antiparallel side-by-side motif for sequence specific-recognition in the minor groove of DNA by 
the designed peptide l-methylimidazole-2-carboxamidenetropsin. Proc. Natl. Acad. Sci. USA 

40 89, 7586-7590 (1992); Trauger, J.W., Baird, E. E. Dervan, P.B. describes the recognition of 
DNA by designed ligands at subnanomolar concentrations. Nature 382, 559-561 (1996). A 
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Py/Py pair specifies A«T from G«C but does not distinguish A«T from T«A. Pelton, J.G. & 
Wemmer, D.E. describes the structural characterization of a 2-1 distamycin A- 
d(CGCAAATTTGGC) complex by two-dimensional NMR. Proc. Natl Acad. Set USA 86, 
5723-5727 (1989); White, S., Baird, E. E. & Dervan, P.B. Describes the effects of the A«T/T«A 

5 degeneracy of pyrrole-imidazole polyamide recognition in the minor groove of DNA. 
Biochemistry 35, 6147-6152 (1996); White, S., Baird, E. E. & Dervan, P. B. describes the 
pairing rules for recognition in the minor groove of DNA by pyrrole-imidazole polyamides. 
Chem. & Biol. 4, 569-578 (1997); White, S., Baird, E. E. & Dervan, P.B. describes the 5'-3' N- 
C orientation preference for polyamide binding in the minor groove. In order to break this 

10 degeneracy, a new aromatic amino acid, 3-hydroxy-N-methylpyrrole (Hp) incorporated into a 
polyamide and paired opposite Py, has been found to discriminate A«T from T«A. The 
replacement of a single hydrogen atom on the pyrrole with a hydroxy group in a Hp/Py pair 
regulates affinity and specificity of a polyamide by an order of magnitude. Utilizing Hp 
together with Py and Im in polyamides to form four aromatic amino acid pairs (Im/Py, Py/Im, 

15 Hp/Py, and Py/Hp) provides a code to distinguish all four Watson-Crick base pairs in the minor 
groove of DNA. 

SUMMARY OF THE INVENTION 

20 The invention encompasses improved polyamides for binding to the minor groove of 

double stranded ("duplex") DNA. The polyamides are in the form of a hairpin comprising two 
groups of at least three consecutive carboxamide residues, the two groups covalently linked by 
an aliphatic amino acid residue, preferably y-aminobutyric acid or 2,4 diaminobutyric acid, the 
consecutive carboxamide residues of the first group pairing in an antiparallel manner with the 

25 consecutive carboxamide residues of the second group in the minor groove of double stranded 
DNA. The improvement relates to the inclusion of a binding pair of Hp/Py carboxamides in the 
polyamide to bind to a T»A base pair in the minor groove of double stranded DNA or Py/Hp 
carboxamide binding pair in the polyamide to bind to an A»T base pair in the minor groove of 
double stranded DNA. The improved polyamides have at least three consecutive carboxamide 

30 pairs for binding to at least three DNA base pairs in the minor groove of a duplex DNA 
sequence that has at least one A»T or T»A DNA base pair, the improvement comprising 
selecting a Hp/Py carboxamide pair to correspond to a T»A base pair in the minor groove or a 
Py/Hp carboxamide pair to bind to an A«T DNA base pair in the minor groove. Preferably the 
binding of the carboxamide pairs to the DNA base pairs modulates the expression of a gene. 

35 

In one preferred embodiment, the polyamide includes at least four consecutive 
carboxamide pairs for binding to at least four base pairs in a duplex DNA sequence. In another 
preferred embodiment, the polyamide includes at least five consecutive carboxamide pairs for 
binding to at least five base pairs in a duplex DNA sequence. In yet another preferred 
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embodiment, the polyamide includes at least six consecutive carboxamide pairs for binding to at 
least six base pairs in a duplex DNA sequence. In one preferred embodiment, the improved 
poiyamides have four carboxamide binding pairs that will distinguish A«T, T«A, OG and G«C 
base pairs in the minor groove of a duplex DNA sequence. The duplex DNA sequence can be a 
regulatory sequence, such as a promoter sequence or an enhancer sequence, or a gene sequence, 
such as a coding sequence or a non-coding sequence. Preferably, the duplex DNA sequence is a 
promoter sequence. 

The preparation and the use of poiyamides for binding in the minor groove of double 
stranded DNA are extensively described in the art. This invention is an improvement of the 
existing technology that uses 3-hydroxy-N-methylpyrrole to provide carboxamide binding pairs 
for DNA binding poiyamides. 

The invention encompasses poiyamides having y-aminobutyric acid or a substituted y- 
aminobutyric acid to form a hairpin with a member of each carboxamide pairing on each side of 
it. Preferably the substituted y-aminobutyric acid is a chiral substituted y-aminobutyric acid 
such as (R)-2,4-diaminobutyric acid. In addition, the poiyamides may contain an aliphatic 
amino acid residue, preferably a p-alanine residue, in place of a non-Hp carboxamide, The P~ 
alanine residue is represented in formulas as p. The p-alanine residue becomes a member of a 
carboxamide binding pair. The invention further includes the substitution as a p«p binding pair 
for non-Hp containing binding pair. Thus, binding pairs in addition to the Hp/Py and Py/Hp are 
Im/p, P/Im, Py/p, (J/Py, and p/p. 

The poiyamides of the invention can have additional moieties attached covalently to the 
polyamide. Preferably the additional moieties are attached as substituents at the amino terminus 
of the polyamide, the carboxy terminus of the polyamide, or at a chiral (R)-2,4-diaminobutyric 
acid residue. Suitable additional moieties include a detectable labeling group such as a dye, 
biotin or a hapten. Other suitable additional moieties are DNA reactive moieties that provide 
for sequence specific cleavage of the duplex DNA. 

Brief Description of the Drawings 

Figure 1 illustrates the structure of polyamide lj 2< and 3. 

Figure 2 illustrates the pairing of poiyamides to DNA base pairs. 

Figure 3 illustrates the DNase footprint titration of compounds 2 and 3. 

Figure 4 illustrates a list of the structures of representative Hp containing poiyamides. 

Figure 5 illustrates the synthesis of a protected Hp monomer for solid phase synthesis. 

Figure 6 illustrates the solid phase synthesis of polyamide 2. 

Figure 7 illustrates the 1H-NMR characterization of polyamide 2. 
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Figure 8 illustrates the Mass spectral characterization of polyamide 2. 

Figure,9 illustrates 1H-NMR characterization of synthesis purity. 

Figure 10 illustrates DNasel footprint titration experiment. 

Figure 1 1 illustrates the synthesis of bifunctional conjugate of polyamide 2. 
5 Figure 12 illustrates affinity cleaving evidence for oriented hairpin formation. 

Figure 13 illustrates increased sequence specificity of Hp/Py containing polyamides. 

Figure 14 illustrates 8-ring hairpin polyamides which target S'-WGTNNW-S' sites. 

Figure 15 illustrates 8-ring hairpin polyamides which target 5'-WGANNW-3' sites. 

Figure 16 illustrates 8-ring hairpin polyamides which target 5'-WGGNNW-3' sites. 
10 Figure 17 illustrates 8-ring hairpin polyamides which target S'-WGCNNWO' sites. 

DETAILED DESCRIPTION OF THE INVENTION 

Within this application, unless otherwise stated, definitions of the terms and illustration 
15 of the techniques of this application may be found in any of several well-known references such 
as: Sambrook, J., et ai, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press (1989); Goeddel, D., ed., Gene Expression Technology*, Methods in 
Enzymology, 185, Academic Press, San Diego, CA (1991); "Guide to Protein Purification" in 
Deutshcer, M.P., ed., Methods in Enzymology, Academic Press, San Diego, CA (1989); Innis, et 
20 al., PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, CA 
(1990); Freshney, R.I., Culture of Animal Cells: A Manual of Basic Technique, 2 nd Ed., Alan 
Liss, Inc. New York, NY (1987); Murray, E.J., ed., Gene Transfer and Expression Protocols, 
pp. 109-128, The Humana Press Inc., Clifton, NJ and Lewin, B., Genes VI f Oxford University 
Press, New York (1997). 

25 

For the purposes of this application, a promoter is a regulatory sequence of DNA that is 
involved in the binding of RNA polymerase to initiate transcription of a gene. A gene is a 
segment of DNA involved in producing a peptide, polypeptide or protein, including the coding 
region, non-coding regions preceding ("leader ') and following ("trailer") the coding region, as 

30 well as intervening non-coding sequences ("introns") between individual coding segments 
f'exons"). Coding refers to the representation of amino acids, start and stop signals in a three 
base "triplet" code. Promoters are often upstream (" '5 to") the transcription initiation site of 
the corresponding gene. Other regulatory sequences of DNA in addition to promoters are 
known, including sequences involved with the binding of transcription factors, including 

35 response elements that are the DNA sequences bound by inducible factors. Enhancers comprise 

4 
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yet another group of regulatory sequences of DNA that can increase the utilization of 
promoters, and can function in either orientation (5'-3' or 3'-5') and in any location (upstream 
or downstream) relative to the promoter. Preferably, the regulatory sequence has a positive 
activity, i.e., binding of an endogeneous ligand (e.g. a transcription factor) to the regulatory 
sequence increases transcription, thereby resulting in increased expression of the corresponding 
target gene. In such a case, interference with transcription by binding a polyamide to a 
regulatory sequence would reduce or abolish expression of a gene. 

The promoter may also include or be adjacent to a regulatory sequence known in the art 
as a silencer. A silencer sequence generally has a negative regulatory effect on expression of 
the gene. In such a case, expression of a gene may be increased directly by using a polyamide 
to prevent binding of a factor to a silencer regulatory sequence or indirectly, by using a 
polyamide to block transcription of a factor to a silencer regulatory sequence. 

It is to be understood that the polyamides of this invention bind to double stranded DNA 
in a sequence specific manner. The function of a segment of DNA of a given sequence, such as 
S'-TATAAAO 1 , depends on its position relative to other functional regions in the DNA 
sequence. In this case, if the sequence 5'-TATAAA-3' on the coding strand of DNA is 
positioned about 30 base pairs upstream of the transcription start site, the sequence forms part 
of the promoter region (Lewin, Genes VI pp. 831-835). On the other hand, if the sequence 5*- 
TATAAA-3' is downstream of the transcription start site in a coding region and in proper 
register with the reading frame, the sequence encodes the tyrosyl and lysyl amino acid residues 
(Lewin, Genes VI, pp. 213-215). 

While not being held to one hypothesis, it is believed that the binding of the polyamides 
of this invention modulate gene expression by altering the binding of DNA binding proteins, 
such as RNA polymerase, transcription factors, TBF, TFIIIB and other proteins. The effect on 
gene expression of polyamide binding to a segment of double stranded DNA is believed to be 
related to the function, e.g., promoter, of that segment of DNA. 

It is to be understood by one skilled in the art that the improved polyamides of the 
present invention may bind to any of the above-described DNA sequences or any other 
sequence having a desired effect upon expression of a gene. In addition, U.S. Patent No. 



5 
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5,578,444 describes numerous promoter targeting sequences from which base pair sequences 
for targeting an improved polyamide of the present invention may be identified. 

It is generally understood by those skilled in the art that the basic structure of DNA in a 
5 living cell includes both major and a minor groove. For the purposes of describing the present 
invention, the minor groove is the narrow groove of DNA as illustrated in common molecular 
biology references such as Lewin, B., Genes VI, Oxford University Press, New York (1997). 

To affect gene expression in a cell, which may include causing an increase or a decrease 
10 in gene expression, a effective quantity of one or more polyamide is contacted with the cell and 
internalized by the cell. The cell may be contacted in vivo or in vitro. Effective extracellular 
concentrations of polyamides that can modulate gene expression range from about 10 
nanomolar to about 1 micromolar. Gottesfeld, J.M., et al. 9 Nature 387 202-205 (1997). To 
determine effective amounts and concentrations of polyamides in vitro, a suitable number of 
15 cells is plated on tissue culture plates and various quantities of one or more polyamide are 
added to separate wells. Gene expression following exposure to a polyamide can be monitored 
in the cells or medium by detecting the amount of the protein gene product present as 
determined by various techniques utilizing specific antibodies, including ELISA and western 
blot. Alternatively, gene expression following exposure to a polyamide can be monitored by 
20 detecting the amount of messenger RNA present as determined by various techniques, including 
northern blot and RT-PCR. 

Similarly, to determine effective amounts and concentrations of polyamides for in vivo 
administration, a sample of body tissue or fluid, such as plasma, blood, urine, cerebrospinal 

25 fluid, saliva, or biopsy of skin, muscle, liver, brain or other appropriate tissue source is 
analyzed. Gene expression following exposure to a polyamide can be monitored by detecting 
the amount of the protein gene product present as determined by various techniques utilizing 
specific antibodies, including ELISA and western blot. Alternatively, gene expression 
following exposure to a polyamide can be monitored by the detecting the amount of messenger 

30 RNA present as determined by various techniques, including northern blot and RT-PCR. 

The polyamides of this invention may be formulated into diagnostic and therapeutic 
compositions for in vivo or in vitro use. Representative methods of formulation may be found 

6 
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in Remington: The Science and Practice of Pharmacy, 19th ed., Mack Publishing Co., Easton, 
PA (1995). 

For in vivo use, the polyamides may be incorporated into a physiologically acceptable 
5 pharmaceutical composition that is administered to a patient in need of treatment or an animal 
for medical or research purposes. The polyamide composition comprises pharmaceutically 
acceptable carriers, excipients, adjuvants, stabilizers, and vehicles. The composition may be in 
solid, liquid, gel, or aerosol form. The polyamide composition of the present invention may be 
administered in various dosage forms orally, parentally, by inhalation spray, rectally, or 
10 topically. The term parenteral as used herein includes, subcutaneous, intravenous, 
intramuscular, intrasternal, infusion techniques or intraperitoneally. 

The selection of the precise concentration, composition, and delivery regimen is 
influenced by, inter alia, the specific pharmacological properties of the particular selected 
15 compound, the intended use, the nature and severity of the condition being treated or diagnosed, 
the age, weight, gender, physical condition and mental acuity of the intended recipient as well 
as the route of administration. Such considerations are within the purview of the skilled artisan. 
Thus, the dosage regimen may vary widely, but can be determined routinely using standard 
methods. 

20 

Polyamides of the present invention are also useful for detecting the presence of double 
stranded DNA of a specific sequence for diagnostic or preparative purposes. The sample 
containing the double stranded DNA can be contacted by polyamide linked to a solid substrate, 
thereby isolating DNA comprising a desired sequence. Alternatively, polyamides linked to a 
25 suitable detectable marker, such as biotin, a hapten, a radioisotope or a dye molecule, can be 
contacted by a sample containing double stranded DNA. 

The design of biflinctional sequence specific DNA binding molecules requires the 
integration of two separate entities: recognition and functional activity. Polyamides that 
30 specifically bind with subnanomolar affinity to the minor groove of a predetermined sequence 
of double stranded DNA are linked to a functional molecule, providing the corresponding 
biflinctional conjugates useful in molecular biology, genomic sequencing, and human medicine. 
Polyamides of this invention can be conjugated to a variety of functional molecules, which can 
be independently chosen from but is not limited to arylboronic acids, biotins, polyhistidines 

7 
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comprised from about 2 to 8 amino acids, haptens to which an antibody binds, solid phase 
supports, oligodeoxynucleotides, N-ethylnitrosourea, fluorescein, bromoacetamide, 
iodoacetamide, DL-a-lipoic acid, acridine, captothesin, pyrene, mitomycin, texas red, 
anthracene, anthrinilic acid, avidin, DAPI, isosulfan blue, malachite green, psoralen, ethyl red, 

5 4-(psoraen-8-yloxy)-butyrate, tartaric acid, (+)-a-tocopheral, psoralen, EDTA, methidium, 
acridine, Ni(II)«Gly-Gly-His, TO, Dansyl, pyrene, N-bromoacetamide, and gold particles. Such 
bifiinctional polyamides are useful for DNA affinity capture, covalent DNA modification, 
oxidative DNA cleavage, DNA photocleavage. Such bifunctional polyamides are useful for 
DNA detection by providing a polyamide linked to a detectable label. Detailed instructions for 

10 synthesis of such bifunctional polyamides can be found in copending U.S. provisional 
application 60/043,444, the teachings of which are incorporated by reference. 

DNA complexed to a labeled polyamide can then be determined using the appropriate 
detection system as is well known to one skilled in the art. For example, DNA associated with 
15 a polyamide linked to biotin can be detected by a streptavidin / alkaline phosphatase system. 

The present invention also describes a diagnostic system, preferably in kit form, for 
assaying for the presence of the double stranded DNA sequence bound by the polyamide of this 
invention in a body sample, such brain tissue, cell suspensions or tissue sections, or body fluid 
20 samples such as CSF, blood, plasma or serum, where it is desirable to detect the presence, and 
preferably the amount of the double stranded DNA sequence bound by the polyamide in the 
sample according to the diagnostic methods described herein. 

The diagnostic system includes, in an amount sufficient to perform at least one 
25 assay, a specific polyamide as a separately packaged reagent. Instructions for use of the 
packaged reagent(s) are also typically included. As used herein, the term "package" refers 
to a solid matrix or material such as glass, plastic (e.g., polyethylene, polypropylene or 
polycarbonate), paper, foil and the like capable of holding within fixed limits a polyamide of 
the present invention. Thus, for example, a package can be a glass vial used to contain 
30 milligram quantities of a contemplated polyamide or it can be a microliter plate well to which 
microgram quantities of a contemplated polypamide have been operatively affixed, i.e., linked 
so as to be capable of being bound by the target DNA sequence. "Instructions for use" typically 
include a tangible expression describing the reagent concentration or at least one assay method 
parameter such as the relative amounts of reagent and sample to be admixed, maintenance time 

8 
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periods for reagent or sample admixtures, temperature, buffer conditions and the like. A 
diagnostic system of the present invention preferably also includes a detectable label and a 
detecting or indicating means capable of signaling the binding of the contemplated polyamide 
of the present invention to the target DNA sequence. As noted above, numerous detectable 
labels, such as biotin/ and detecting or indicating means, such as enzyme-linked (direct or 
indirect) streptavidin, are well known in the art. 

Figure 1 shows representative structures of polyamides. ImlmPyPy-y-ImPyPyPy-p-Dp 
(1), ImlmPyPy-Y-ImHpPyPy-P-Dp (2), and ImlmHpPy-y-ImPyPyPy-p-Dp (3). (Hp = 3- 
hydroxy-N-methylpyrrole, Im = N-methylimidazole, Py = N-methylpyrrole, p = p-alanine, y = 
y-aminobutyric acid, Dp = Dimethylaminopropylamide). Polyamides were synthesized by solid 
phase methods using Boc-protected 3-methoxypyrrole, imidazole, and pyrrole aromatic amino 
acids, cleaved from the support by aminolysis, deprotected with sodium thiophenoxide, and 
purified by reversed phase HPLC. Baird, E. E. & Dervan, P. B. describes the solid phase 
synthesis of polyamides containing imidazole and pyrrole amino acids. J. Am. Chem. Soa 118, 
6141-6146 (1996); also see PCT US 97/003332. The identity and purity of the polyamides 
were verified by l H NMR, analytical HPLC, and matrix-assisted laser-desorption ionization 
time-of- flight mass spectrometry (MALDI-TOF MS-monoisotopic): 1 1223.6 (1223.6 
calculated), 2 1239.6 (1239.6 calculated); 3 1239.6 (1239.6 calculated). 

Figure 2 illustrates binding models for polyamides 1-3 in complex with 5'-TGGTCA-3* 
and 5'-TGGACA-3' (A*T and T«A in fourth position highlighted). Filled and unfilled circles 
represent imidazole and pyrrole rings respectively; circles containing an H represent 3- 
hydroxypyrrole, the curved line connecting the polyamide subunits represents y-aminobutyric 
acid, the diamond represents p-alanine, and the t- represents the positively charged 
dimethylaminopropylamide tail group. 

Figure 3 shows quantitative DNase I footprint titration experiments with polyamides 2 
and 3 on the 3' 32 P labeled 250-bp pJK6 EcoRUP\ndl restriction fragment. Lane 1, intact DNA; 
lanes 2-11 DNase I digestion products in the presence of 100, 50, 20, 10, 5, 2, 1, 0.5, 0.2, 0.1 
nM polyamide, respectively; lane 12, DNase I digestion products in the absence of polyamide; 
lane 13, adenine-specific chemical sequencing. Iverson, B. L. & Dervan, P. B. describes an 
adenine-specific DNA chemical sequencing reaction. Methods EnzymoL 15, 7823-7830 (1987). 
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All reactions were done in a total volume of 400 jaL. A polyamide stock solution or H 2 0 was 
added to an assay buffer containing radiolabeled restriction fragment, with the final solution 
conditions of 10 mM Tris-HCl, 10 mM KC1, 10 mM MgCl 2 , 5 mM CaCl 2 , pH 7.0. Solutions 
were allowed to equilibrate for 4-12 h at 22 °C before initiation of footprinting reactions. 
5 Footprinting reactions, separation of cleavage products, and data analysis were carried out as 
described. White, S., Baird, E. E. & Dervan, P. B. Effects of the A»T/T»A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA. Biochemistry 35, 6147- 
6152 (1996). 

10 Figure 4 shows the structure and equilibrium dissociation constant for numerous 

compounds of the present invention. Polyamides are shown in complex with their respective 
match site. Filled and unfilled circles represent imidazole (Im) and pyrrole (Py) rings, 
respectively; circles containing an H represent 3-hydroxypyrrole (Hp), the curved line 
connecting the polyamide subunits represents y-aminobutyric acid (y), the diamond represents 

15 (i-alanine (p), and the + represents the positively charged dimethylaminopropylamide tail group 
(Dp). The equilibrium dissociation constants are the average values obtained from three DNase 
I footprint titration experiments. The standard deviation for each set is less than 15% of the 
reported number. Assays were carried out in the presence of 10 mM Tris»HCl, 10 mM KC1, 10 
mM MgCl 2 , and 5 mM CaCl 2 at pH 7.0 and 22°C. 

20 

Figure 5 shows the synthetic scheme for 3-O-methyl-N-Boc protected pyrrole-2- 
carboxylate. The hydroxypyrrole monoester can be prepared in 0.5 kg quantity using published 
procedures on enlarged scale. 

25 Figure 6 shows the solid phase synthetic scheme for ImlrnPyPy-y-ImHpPyPy-P-Dp 

starting from commercially available Boc-p-Pam-Resin: (i) 80% TFA/DCM, 0.4 M PhSH; (ii) 
Boc-Py-OBt, DIEA, DMF; (iii) 80% TFA/DCM, 0.4 M PhSH; (iv) Boc-Py-OBt, DIEA, DMF; 
(v) 80% TFA/DCM, 0.4 M PhSH; (vi) Boc-3-OMe-Py-OH, HBTU, DMF, DIEA; (vii) 80% 
TFA/DCM, 0.4 M PhSH; (viii) Boc-Im-OH, DCC, HOBt; (ix) 80% TFA/DCM, 0.4 M PhSH; 

30 (x) Boc-y-aminobutyric acid, DIEA, DMF; (xi) 80% TFA/DCM, 0.4 M PhSH; (xii) Boc-Py- 
OBt, DIEA, DMF; (xiii) 80% TFA/DCM, 0.4 M PhSH; (xiv) Boc-Py-OBt, DMF, DIEA; (xv) 
80% TFA/DCM, 0.4 M PhSH; (vxi) Boc-Im-OH, DCC, HOBt (xvii) 80% TFA/DCM, 0.4 M 
PhSH; (xviii) imida2ole-2-carboxylic acid, HBTU, DIEA; (xviv) dimethylaminopropylamine, 
55 °C, 18h. Purification by reversed phase HPLC provides ImlmPyPy-y-ImOpPyPy-p-Dp. (Op 

35 = 3-methoxypyrrole). Treatment of the 3-methyoxypyrrole polyamide with thiophenol, NaH, 
DMF, at 100 °C for 120 min provides polyamide 2 after reverse phase HPLC purification. 



10 
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Figure 7 shows the aromatic region from 7-11 ppm for the 1H-NMR spectrum 
determined at 300 MHz for ImlmPyPy-y-ImOpPyPy-P-Dp and ImlmPyPy-y-ImHpPyPy-p-Dp. 
This region of the spectrum may be used to determine compound identity and purity. 

5 Figure 8 shows-the MALDI-TOF mass spectrum determined in positive ion mode with a 

monoisotopic detector for the polyamides for ImlmPyPy-y-ImOpPyPy-P-Dp and ImlmPyPy-y- 
ImHpPyPy-P-Dp. This spectrum may be used to determine compound identity and purity. 

Figure 9 shows the methyl group region from 3.5*4.0 ppm for the 1H-NMR spectrum 
10 determined at 300 MHz for ImPyPy-y-OpPyPy-p-Dp and ImPyPy-y-HpPyPy-p-Dp. This region 
of the spectrum may be used to directly follow the progress for conversion of 3-methoxypyrrole 
to 3-hydroxypyrrole. 

Fig. 10 shows quantitative DNase I footprint titration experiments with the polyamides 

15 ImPyPy-y-PyHpPy-P-Dp and ImHpPy-y-PyPyPy-p-Dp on the 3*- 32 P labeled 370-bp pDEHl 
EcoKUPvtdl restriction fragment. Intact lane, labeled restriction fragment no polyamide or 
DNase I added; lanes 1-10, DNase I digestion products in the presence of 10 fiM, 5 fiM, 2 |iM, 
1 jiM, 500 nM, 200 nM, 100 nM, 50 nM, 20 nM, 10 nM ImPyPy-y-PyPyPy-P-Dp, respectively 
or 1 \jlM, 500 nM, 200 nM, 100 nM, 50 nM, 20 nM, 10 nM, 5 nM, 2 nM, 1 nM ImHpPy-y- 

20 PyPyPy-p-Dp, respectively; DNase I lane, DNase I digestion products in the absence of 
polyamide; A lane, adenine-specific chemical sequencing. Iverson, B. L. & Dervan, P. B. 
describes an adenine-specific DNA chemical sequencing reaction. Methods EnzymoL 15, 7823- 
7830 (1987). All reactions were done in a total volume of 40 (iL. A polyamide stock solution 
or H 2 0 was added to an assay buffer containing radiolabeled restriction fragment, with the final 

25 solution conditions of 10 mM Tris-HCl, 10 mM KCl, 10 mM MgCl 2 , 5 mM CaCl 2 , pH 7.0. 
Solutions were allowed to equilibrate for 4-12 h at 22 °C before initiation of footprinting 
reactions. Footprinting reactions, separation of cleavage products, and data analysis were 
carried out as described. White, S., Baird, E. E. & Dervan, P. describe the pairing rules for 
recognition in the minor groove of DNA by pyrrole-imidazole polyamides. Chemistry & 

30 Biology 4,569-578 (1997). 

Figure 1 1 shows the synthesis of a Afunctional polyamide which incorporates the Hp/Py 
pair. Treatment of a sample of ImlmPyPy-y-ImHpPyPy-P-Pam-resin (see Figure 6) with 3,3'- 
diamino-W-methyldipropylamine, 55°C, 18 h followed by reverse phase HPLC purification 
35 provides the Op polyamide with a free primary amine group which can be coupled to an 
activated carboxylic acid derivative. Treatment with (i) EDTA-dianhydride, DMSO/NMP, 
DIEA, 55 °C; (ii) 0.1M NaOH, followed by reverse phase HPLC purification provides the Op- 
Py-Im-polyamide-EDTA conjugate. Treatment of the 3-methyoxypyrrole polyamide with 
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thiophenol, NaH, DMF, at 100 °C for 120 min provides polyamide 2 after reverse phase HPLC 
purification. 

Figure 12 shows the determination of the binding orientation of hairpin polyamides 

5 ImImPyPy-y-ImHpPyPy-p-Dp-EDTA*Fe(II) 2-E»Fe(II) and ImlmHpPy-y-ImPyPyPy-P-Dp- 
EDTA»Fe(II) 3-E»Fe(II) by affinity cleaving footprint titration. Top and bottom left: Affinity 
cleavage experiments on a 3' 32 P labeled 250-bp pJK6 EcoKU Pvu II restriction fragment. The 
5 , -TGGACA-3' and 5'-TGGTCA-3' sites are shown on the right side of the autoradiogram. 
Top left: lane 1, adenine-specific chemical sequencing reaction; lanes 2-6, 6.5 ^M, 1.0 f*M, 

10 100 nM, 10 nM, 1 nM polyamide 2-E # Fe(II); lane 7, intact restriction fragment, no polyamide 
added. Bottom left: lane 1, A reaction; lanes 2-6, 8.5 ^iM, 1.0 |iM, 100 nM, 10 nM, 1 nM 
polyamide 3-E*Fe(II); lane 7, intact DNA. All reactions were carried out in a total volume of 
40 ^L. A stock solution of polyamide or H 2 0 was added to a solution containing 20 kcpm 
labeled restriction fragment, affording final solution conditions of 25 mM Tris- Acetate, 20 mM 

15 NaCl, 100 \xM/ bp calf thymus DNA, at pH 7.0, Solutions were allowed to equilibrate for a 
minimum of 4 h at 22°K before initiation of reactions. Affinity cleavage reactions were carried 
out as described White, S., Baird, E.E. & Dervan, P.B. Effects of the A*T/T»A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA. Biochemistry 35, 6147- 
6152 (1996). Top and bottom right: Affinity cleavage patterns of 2-E»Fe(II) and 3-E*Fe(II) at 

20 100 nM bound to 5'-TGGACA-3' and 5'-TGGTCA-3\ Bar heights are proportional to the 
relative cleavage intensities at each base pair. Shaded and nonshaded circles denote imidazole 
and pyrrole carboxamides, respectively. Nonshaded diamonds represent the P-alanine moiety. 
A curved line represents the y-aminobutyric acid, and the + represents the positively charged 
dimethylaminopropylamide tail group. The boxed Fe denotes the EDTA'Fe(II) cleavage 

25 moiety. 

Figure 13 shows quantitative DNase I footprint titration experiments with the 
polyamides ImPyPyPyPy-y-ImPyPyPyPy-P-Dp and ImHpPyPyPy-y-ImHpPyPyPy-p-Dp on the 
3' 32 P labeled 252-bp pJK7 EcoRl/ Pvu II restriction fragment. For ImPyPyPyPy-y- 

30 ImPyPyPyPy-P-Dp gel (left): lane 1, DNase I digestion products in the absence of polyamide; 
lanes 2-18, DNase I digestion products in the presence of 1.0 jiM, 500 nM, 200, 100, 65, 40, 25, 
15, 10, 6.5, 4.0, 2.5, 1.5, 1.0, 0.5, 0.2, 0.1 nM polyamide; lane 19, DNase I digestion products in 
the absence of polyamide; lane 20, intact restriction fragment; lane 21, guanine-specific 
chemical sequencing reaction; lane 22, adenine-specific chemical sequencing reaction. For 

35 ImHpPyPyPy-y-ImHpPyPyPy-P-Dp gel (right): lane 1, intact DNA; lane 2, DNase I digestion 
products in the absence of polyamide; lanes 3-19, 1.0 ^M, 500 nM, 200, 100, 50, 20, 10, 5, 2, 1, 
0.5, 0.2. 0.1, 0.05, 0.01, 0.005, 0.001 nM polyamide; lane 20, DNase I digestion products in the 
absence of polyamide; lane 21, A reaction. All reactions were done in a total volume of 400 

12 
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^iL. A polyamide stock solution or H 2 0 was added to an assay buffer containing radiolabeled 
restriction fragment, with the final solution conditions of 10 mM Tris-HCl, 10 mM KC1, 10 mM 
MgCl 2t 5 mM CaCl 2 , pH 7.0. Solutions were allowed to equilibrate for 4-12 h at 22°C before 
initiation of footprinting reactions. Footprinting reactions, separation of cleavage products, and 
data analysis were carried as described. White, S., Baird, E.E. & Dervan, P.B. Effects of the 
A»T/T»A degeneracy of pyrrole-imidazole polyamide recognition in the minor groove of DNA. 
Biochemistry 35, 6147-6152 (1996). 

Fig. 14 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula XjX^X^y-XsXaXyXg wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X 4 /X 5 , X 3 /X 6) X 2 /X 7 , and X\/X% represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGTNNW-3', where the 5'-GTNN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5'-WGTNNW-3* sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 

Fig. 15 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula XiX 2 X 3 X4-y-X 5 X 6 X7X 8 wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X 4 /X 5 , X 3 /X 6 , X 2 /X 7 , and X t /X 8 represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGANNW-3\ where the S'-GANN-S' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5'-WGANNW-3' sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 
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Fig. 16 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula X 1 X 2 X 3 X4-y-X 5 X 6 X 7 X 8 wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 

5 diaminobutyric acid; X 4 /X 5 , X 3 /X 6 , X 2 /X 7 , and X^/Xg represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGGNNW-3\ where the 5'-GGNN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 

10 minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 S'-WGGNNWO' sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 

15 

Fig. 17 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula XiX 2 X 3 X4-Y-X 5 X 6 X7X 8 wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 

20 diaminobutyric acid; X 4 /X 5 , X 3 /X 6 , X 2 /X 7 , and Xj/Xg represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGCNNW-3', where the 5'-GCNN-3* core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G T C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 

25 minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5 , -WGCNN\V-3 1 sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 

30 

Four-ring polyamide subunits, covalently coupled to form eight-ring hairpin structures, 
bind specifically to 6-bp target sequences at subnanomolar concentrations. Trauger, J.W., 
Baird, E. E. & Dervan, P.B. describe the recognition of DNA by designed ligands at 
subnanomolar concentrations. Nature 382, 559-561 (1996); Swalley, S. E., Baird, E. E. & 
35 Dervan, P. B. describe the discrimination of 5'-GGGG-3\ 5*-GCGC-3\ and 5'-GGCC'3' 
sequences in the minor groove of DNA by eight-ring hairpin polyamides. /. Am. Chem. Soc. 
119, 6953-6961 (1997). The DNA-binding affinities of three eight-ring hairpin polyamides 
shown in Figure 1 as compound 1, 2, and 3 containing pairings of Im/Py, Py/Im opposite G«C, 

14 
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OG and either Py/Py, Hp/Py, or Py/Hp at a common single point opposite T*A and A«T has 
been determined. Equilibrium dissociation constants (K^) for ImlmPyPy-y-ImPyPyPy-P-Dp 1, 
ImlmPyPy-y-ImHpPyPy-P-Dp 2, ImlmHpPy-y-ImPyPyPy-p-Dp 3 of Figure 1 are shown in 
Table 1. Brenowitz, M, Senear, D. F M Shea, M. A. & Ackers, G. K. describe a quantitative 
DNase footprint titration method for studying protein-DNA interactions. Methods Enzymol. 
130, 132-181 (1986); The values were determined by quantitative DNase I footprint 
titration experiments: on a 3' 32 P-labeled 250-bp DNA fragment containing the target sites, 5'~ 
TGGACA-3' and 5 , -TGGTCA-3' which differ by a single A»T base pair in the fourth position. 
The DNase footprint gels are shown in Figure 3. 



TABLE 1 Equilibrium dissociation constants* 



Polyamidet 



5'-TCGTCA-3' 



5'-TCGACA-3' 



5 -T G G 



1 Py/Py 



C A-3' 5'-T G G 



•♦OCX •♦PK^ 

tKKX>C#^ +XKX}Or 

3'-A C C[AjG T-5' 3-A C C [TJ G T-5' 

K d = 0.077 nM AT d = 0.l5 nM 



C A-3' 



2 Py/Hp 



2.0 



5'-T G G [t1 C A-3* 5'-T G G (XI C A-3' 

+KKX>®r +>000®#^ 0.06 

3-A C c[a)G T-5* 3'-A C C [tJ G T-5' 

K d = 15 nM £ d = 0.83 nM 



5'-T G GfTlc A-3' 5'-T G G [jCI C A-3' 

3 Hp/Py +xk>oo^ +XKX>or 

3'-A C C A G T-5' 3'-A C C X G T-5' 



K d ~0.48nM 



AT H =37 nM 



77 



*The reported dissociation constants are the average values obtained from three 
DNase I footprint titration experiments. The standard deviation tor each data set is 
less than 15% of the reported number. Assays were carried out in the presence of 10 
mM Tris'HCi, 10 mM KCI, 10 mM MgCI 2 , and 5 mM CaCU at pH 7.0 and 22 °C 
tRing pairing opposite T*A and A*T in the fourth position. 
*Calculated as K d (5'-TGGAG\-3')/ K d (5'-TGGTC A-3'). 



Based on the pairing rules for polyamide-DNA complexes both of these sequences are a 
match for control polyamide 1 which places a Py/Py pairing opposite 

A»T and T»A at both sites. It was determined that in polyamide 1 (Py/Py) binds to 5 T - 
TGG1CA-3 1 and 5'-TGGACA-3' within a factor of 2 (K^ = 0.077 or 0.15 nM respectively). In 
contrast, polyamide 2 (Py/Hp) binds to 5'-TGGICA-3' and 5 , -TGGACA-3 t with dissociation 
constants which differ by a factor of 18 = 15 nM and 0.83 nM respectively). By reversing 
the pairing in polyamide 3 (Hp/Py) the dissociation constants differ again in the opposite 
direction by a factor of 77 (K D = 0.48 nM and 37 nM respectively. Control experiments 
performed on separate DNA fragments; reveal that neither a 5'-TGGGCA-3' or a 5'-TGGCCA- 
3' site is bound by polyamide 2 or 3 at concentrations < 100 nM, indicating that the Hp/Py and 
Py/Hp ring pairings do not bind opposite G»C or OG. The A»T vs. T»A discrimination is 
achieved preferably when the two neighboring base pairs are G«C and OG (GTC vs. GAC). 



15 



WO 98/37066 PCT/US98/01006 

The specificity of polyamides 2 and 3 for sites which differ by a single A«T/T»A base 
pair results from small chemical changes. Replacing the Py/Py pair in 1 with a Py/Hp pairing 
as in 2, a single substitution of C3-OH for C3-H, destabilizes interaction with S'-TGGTCAO' 
by 191-fold, a free energy difference of 3.1 kcal mol* 1 . Interaction of 2 with 5~-TGGACA-3 T is 
5 destabilized only 6-fold relative to 1, a free energy difference of 1.1 kcal mol" 1 . Similarly, 
replacing the Py/Py pair in 1 with Hp/Py as in 3 destabilizes interaction with 5'-TGGACA-3' 
by 252-fold, a free energy difference of 3.2 kcal mol* 1 . Interaction of 3 with S'TGGTCAO 1 is 
destabilized only 6-fold relative to 1, a free energy difference of 1.0 kcal mol" 1 . 

10 The polyamides of this invention provide for coded targeting of predetermined DNA 

sequences with affinity and specificity comparable to sequence-specific DNA binding proteins. 
Hp, 1m, and Py polyamides complete the minor groove recognition code using three aromatic 
amino acids which combine to form four ring pairings (Im/Py, Py/Im, Hp/Py, and Py/Hp) which 
complement the four Watson-Crick base pairs, as shown in TABLE 2. There are a possible 240 

15 four base pair sequences which contain at least 1 A»T or T^A base pair and therefore can 
advantageously use an Hp/Py, or Py/Hp carboxamide binding. Polyamides binding to any of 
these sequences can be designed in accordance with the code of TABLE 2. 

TABLE 2 Pairing code for minor groove recognition* 
Pair G*C PG T*A A«T 

Im/Py + 

Py/Im + 

Hp/Py + 

Py/Hp " - + 

* favored (+), disfavored (-) 

20 
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For certain G*C rich sequences the affinity of polyamide-DNA complexes may be 
enhanced by substitution of an Im/p pair for Im/Py at G*C and p/Im for Py/Im at OG. At A*T 
and T»A base pairs, either a Py/p, p/Py, and p/p may be used. The alternate aliphatic/aromatic 
amino acid pairing code is described in Table 3. 



TABLE 3 Aliphatic/ Aromatic substitution for ring 


pairings* 




Pair 


Substitution 


Im/Py 


Im/p 


Py/Im 


p/Im 


Hp/Py 


Py/p, p/Py, Hp/p, p/p 


Py/Hp 


Py/P, p/Py, p/Hp, p/p 



U. S. Patent 5,578,444 describes numerous promoter region targeting sequences from 
which base pair sequences for targeting a polyamide can be identified. 

PCT U.S. 97/003332 describes methods for synthesis of polyamides which are suitable 
for preparing polyamides of this invention. The use of p-alanine in place of a pyrrole amino 
acid in the synthetic methods provides aromatic/aliphatic pairing (Im/p, p/Im, Py/p, and p/Py) 
and aliphatic/aliphatic pairing (p/p) substitution. The use of y-aminobutyric acid, or a 
substituted y-aminobutyric acid such as (R)-2,4 diaminobutyric acid, provides for preferred 
hairpin turns. The following examples illustrate the synthesis of polyamides of the present 
invention. 
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Example 1 : 

PREPARATION OF A PROTECTED Hp MONOMER FOR SOLID PHASE 

SYNTHESIS. 

Distamycin and its analogs have previously been considered targets of traditional 

5 multistep synthetic chemistry. Arcamone, F., Orezzi, P. G., Barbieri, W., Nicolella, V, & Penco, 
S. describe a solution phase synthesis of distamycin Gazz. Chim. Ital 1967, 97, 1097. The 
repeating amide of distamycin is formed from an aromatic carboxylic acid and an aromatic 
amine. The aromatic acid is often unstable to decarboxylation, and the aromatic amines have 
been found to be air and light sensitive. Lown, J. W. & Krowicki, K. describe a solution phase 

10 synthesis of Distamycin J. Org. Chem. 1985, 50, 3774. The variable coupling yields, long 
reaction times (often >24 h), numerous side products, and reactive intermediates (acid chlorides 
and trichloro ketones) characteristic of the traditional solution phase coupling reactions make 
the synthesis of the aromatic carboxamides problematic. B. Merrifield describes the solid phase 
synthesis of a tetrapeptide J. Am. Chem. Soc. 1963, 85, 2149. In order to implement an efficient 

15 solid phase methodology for the synthesis of the pyrrole- imidazole polyamides, the following 
components were developed: (1) a synthesis which provides large quantities of appropriately 
protected monomer or dimer building blocks in high purity, (2) optimized protocols for forming 
an amide in high yield from a support-bound aromatic amine and an aromatic carboxylic acid, 
(3) methods for monitoring reactions on the solid support, and (4) a stable resin linkage agent 

20 that can be cleaved in high yield upon completion of the synthesis. Baird, E. E. & Dervan, P. B. 
describes the solid phase synthesis of polyamides containing imidazole and pyrrole amino 
acids. J. Am. Chem. Soc. 118, 6141-6146 (1996); also see PCT US 97/003332. In order to 
prepare polyamides which contain the 3-hydroxypyrrole monomer, a synthesis has been 
developed which allows the appropriately protected Boc-Op acid monomer to be prepared on 50 

25 g scale. *H NMR and I3 C NMR spectra were recorded on a General Electric-QE 300 NMR 
spectrometer in CD3OD or DMSO-t/g, with chemical shifts reported in parts per million relative 
to residual CHCbOD or DMSO-^5, respectively, IR spectra were recorded on a Perkin-Elmer 
FTTR spectrometer. High-resolution mass spectra were recorded using fast atom bombardment 
(FABMS) techniques at the Mass Spectrometry Laboratory at the University of California, 

30 Riverside. Reactions were executed under an inert argon atmosphere. Reagent grade chemicals 
were used as received unless otherwise noted. Still, W. C, Kahn, M. & Mitra, A. describe flash 
column chromatography J. Org. Chem. 1978, 40, 2923-2925. Flash chromatography was 
carried out using EM science Kieselgel 60 (230-400) mesh. Thin-layer chromatography was 
performed on EM Reagents silica gel plates (0.5 mm thickness). All compounds were 

35 visualized with short-wave ultraviolet light. 
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Table 4 :lntermediates for pr eparation of Boc-protected 3-methoxypyrrole 
NA ME STRUCTURE 

O 

Ethyl 4-carboxy-3-hydfoxy-l- HO /f~\ 

methylpyrrole-2-carboxylate. \ N ^\^° Sv - / 




Ethyl 4-[(Benzyloxycarbonyl)amino]-3- f ^ ^ 

hydroxy-l-methylpyrrole-2-carboxylate 




I o 



Ethyl 4-[(BerLzyloxycarbonyl)amino]-3- 
methoxy-l-methylpyrrole-2-carboxylate 



O K OMe 



Ethyl 4-[(tert-Butyloxycarbonyl)amino]-3- JV^^Y"^ /° Me 

methoxy-l-methylpyrrole-2-carboxylate ^\ u Q 



4-[(tert-Butyloxycarbonyl)amino]-3-methoxy 
-l-methylpyrrole-2-carboxylic acid 



.N N OMe 



OH 

O 



Ethyl 4- [(benzyloxycarbonyl)amino]-3-hydroxy-l -methylpyrrole-2-carboxylate E thy 1-4- 
carboxy-3-hydroxy-i-methylpyrrole-2-carboxylate (60 g, 281.7 mmol) was dissolved in 282 
mL acetonitrile. TEA (28.53 g, 282 mmol) was added, followed by diphenylphosphorylazide 
(77.61 g, 282 mmol). The mixture was refluxed for 5 hours, followed by addition of benzyl 
alcohol (270 ml) and reflux continued overnight. The solution was cooled and volitiles 
removed in vacuo. The residue was absorbed onto silca and chromatagraphed, 4:1 hexanes : 
ethyl acetate, to give a white solid (21.58 g, 24%) *H NMR (DMSO-d6) 5 8.73 (s, 1H), 8.31 (s, 
1H), 7.31 (m, 5H), 6.96 (s, 1H), 5.08 (s, 2H), 4.21 (q, 2H, J = 7.1 Hz), 3.66 (s, 3H), 1.25 (t, 3H, 
J = 7.1 Hz);MSm/e 319.163 (M+H 319.122 calcd. for C16H1SN2O5). 

Ethyl 4-[(tert-butoxycarbonyl)amino]-3-methoxy'l-methylpyrrole-2' carboxylate. Ethyl 
4-[(benzyloxycarbonyl)amino]-3-hydroxy-l-methylpyrrole-2-carboxylate (13.4 g, 42.3 mmol) 
was dissolved in 110 mL acetone. Anhydrous K2CO3 (11.67 g, 84.5 mmol) was added, 
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followed by methyiiodide (5.96 g, 42.3 mmol) and dimethylaminopyridine (0.5 g, 4.23 mmol) 
and the mixture stirred overnight. The solid K2CO3 was removed by filtration and 200 ml 
water added. Volitiles were removed in vacuo and the solution made acidic with addition of IN 
H2SO4 . The aqueous layer was extracted with diethyl ether. Organic layers were combined, 

5 washed with 10% H2SO4, dried over MgSC>4, and dried to give a white solid. The solid was 
used without further purification and dissolved in 38 ml DMF. DIEA (1 1 ml), Boc anhydride 
(9.23 g, 42.3 mmol), and 10 % Pd/C (500 mg) were added and the solution stirred under 
hydrogen (1 atm) for 2.1 h. The slurry was filtered through celite which was washed with 
methanol. Water 250 ml was added and volitiles removed in vacuo. The aqueous layer was 

10 extracted with ether. Organic layers were combined, washed with water and brine, and dried 
over MgS04, Solvent was removed in vacuo to give a white solid ( 8.94 g, 71%) *H NMR 
(DMSO-d 6 ) 5 8.43 (s, 1H), 7.03 (s, 1H), 4.19 (q, 2H, J = 7.1 Hz), 3.70 (s, 3H), 3.67 (s, 3H), 
1.42 (s, 9H), 1.26 (t, 3H, J = 7.1); MS m/e 299.161 (M+H 299.153 calcd. for C14H22N2O5). 

15 Ethyl 4-[(benzyloxycarbonyl)amino] r -J -hydroxy- 1 -methylpyrrole-2-carboxylate Ethyl 4- 

[(/er/-butoxycarbonyl)amino]-3-methoxy-l-methylpyrrole-2-carboxylate (9.0 g, 30.2 mmol) 
was dissolved in 30 mL ethanol. NaOH (30 ml, 1 M, aq) was added and the solution stirred for 
4 days. Water (200 ml) was added and ethanol removed in vacuo. The solution was extracted 
with diethyl ether, aqueous layer acidified to pH = 2-3, and extracted again with diethyl ether. 

20 Organic layers were dried over MgS04, and solvent removed in vacuo to give a white solid (6.0 
g, 20.5 mmol, 87% based on recovered SM) *H NMR (DMSO-d6) 8 12.14 (s, 1H), 8.37 (s, 
1H), 6.98 (s, 1H), 3.69 (s, 3H), 3.66 (s, 3H), 1.42 (s, 9H); MS m/e 293.112 (M+H 293.104 
calcd. for C12H18N2O5). 

25 EXAMPLE 2: 

SOLID PHASE SYNTHESIS OF 3-HYDROXYPYRROLE POLYAMIDES. 

Cycling protocols were optimized to afford high stepwise coupling yields (>99%). 
Deprotection by aminolysis affords up to 100 mg quantities of polyamide. Solid phase 

30 polyamide synthesis protocols were modified from the in situ neutralization Boc -chemistry 
protocols. Schnolzer, M., Alewood, P., Jones, A., Alewood, D., Kent, S.B.H. report rapid in situ 
neutralization for solid phase peptide synthesis Int. J. Peptide. Protein. Res. 1992, 40, 180. 
Coupling cycles are rapid, 72 min per residue for manual synthesis or 180 min per residue for 
machine-assisted synthesis, and require no special precautions beyond those used for ordinary 

35 solid phase peptide synthesis. Manual solid phase synthesis of a pyrrole-imidazole polyamide 
consists of a dichloromethane (DCM) wash, removal of the Boc group with trifluoroacetic acid 
(TFA)/DCM/thiophenol (PhSH), a DCM wash, a DMF wash, taking a resin sample for analysis, 
addition of activated monomer, addition of DIEA if necessary, coupling for 45 min, taking a 
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resin sample for analysis, and a final DMF wash (Figure 5, Table I). In addition, the manual 
solid phase protocol for synthesis of pyrrole-imidazole polyamides has been adapted for use on 
a ABI 430A peptide synthesizer. The aromatic amine of the pyrrole and imidazole do not react 
in the quantitative ninhydrin test. Stepwise cleavage of a sample of resin and analysis by HPLC 
indicates that high stepwise yields (> 99%) are routinely achieved. 

Dicyclohexylcarbodiimide (DCC), Hydroxybenzotriazole (HOBt), 2-(lH-Benzotriazole- 
l-yl)-l,l,3,3-tetramethyluronium hexa-fluorophosphate (HBTU) and 0.2 mmol/gram Boc-p- 
alanine-(-4-carboxamidomethyl)-benzyl-ester-copoly(styrene-divinylbenzene) resin (Boc-P- 
Pam-Resin) was purchased from Peptides International (0.2 mmol/gram), NovaBiochem (0.6 
mmol/gram), or Peninsula (0.6 mmol/gram). ( (7?)-2-Fmoc-4-Boc-diaminobutyric acid, (S)-2- 
Fmoc-4-Boc-diaminobutyric acid, and (/?)-2-amino-4-Boc-diaminobutyric acid were purchased 
from Bachem. A^diisopropylethylamine (DIEA), Mtf-dimethylformamide (DMF), N- 
methylpyrrolidone (NMP), DMSO/NMP, Acetic anhydride (Ac 2 0), and 0.0002 M potassium 
cyanide/pyridine were purchased from Applied Biosystems. Dichloromethane (DCM) and 
triethylamine (TEA) were reagent grade from EM, thiophenol (PhSH), 
dimethylaminopropylamine (Dp), Sodium Hydride, (/?)-a-methoxy-a- 

(trifuoromethyl)phenylacetic acid ((/?)MPTA) and (S)-a-methoxy-ct- 

(trifouromethyl)phenylacetic acid ((5)MPTA) were from Aldrich, trifluoroacetic acid (TFA) 
Biograde from Halocarbon, phenol from Fisher, and ninhydrin from Pierce. All reagents were 
used without further purification. 

Quik-Sep polypropylene disposable filters were purchased from Isolab Inc. l U NMR 
spectra were recorded on a General Electric-QE NMR spectrometer at 300 MHz with chemical 
shifts reported in parts per million relative to residual solvent. UV spectra were measured in 
water on a Hewlett-Packard Model 8452A diode array spectrophotometer. Optical rotations 
were recorded on a JASCO Dip 1000 Digital Polarimeter. Matrix-assisted, laser 
desorption/ionization time of flight mass spectrometry (MALDI-TOF) was performed at the 
Protein and Peptide Microanalytical Facility at the California Institute of Technology. HPLC 
analysis was performed on either a HP 1090M analytical HPLC or a Beckman Gold system 
using a RAINEN C l8 , Microsorb MV, 5|im, 300 x 4.6 mm reversed phase column in 0.1% 
(wt/v) TFA with acetonitrile as eluent and a flow rate of 1.0 mL/min, gradient elution 1.25% 
acetonitrile/min. Preparatory reverse phase HPLC was performed on a Beckman HPLC with a 
Waters DeltaPak 25 x 100 mm, 100 \im CI 8 column equipped with a guard, 0.1% (wt/v) TFA, 
0.25% acetonitrile/min. 18MQ water was obtained from a Millipore MilliQ water purification 
system, and all buffers were 0.2 ^m filtered. 
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Activation of Boc-3-methoxypyrroie acid. The amino acid (0.5 mmol) was dissolved in 2 mL 
DMF. HBTU (190 mg, 0.5 mmol) was added followed by DIEA (1 mL) and the resulting 
mixture was shaken for 5 min. 

5 Activation of Imidazole-2-carboxyIic acid, y-aminobutyric acid, Boc-glycine, and Boc-P- 
alanine. The appropriate amino acid or acid (2 mmol) was dissolved in 2 mL DMF. HBTU 
(720 mg, 1.9 mmol) was added followed by DIEA (1 mL) and the solution shaken for at least 5 
min. 

10 Activation of Boe-Imidazole acid. Boc imidazole acid (257 mg, 1 mmol) and HOBt (135 mg, 
1 mmol) were dissolved in 2 mL DMF, DCC (202 mg, 1 mmol) is then added and the solution 
allowed to stand for at least 5 min. 

Acetylation Mix. 2 mL DMF, DIEA (710 juL, 4.0 mmol), and acetic anhydride (380 |iL, 4.0 
15 mmol) were combined immediately before use, 

Manual Synthesis Protocol. Boc-B-alanine-Pam-Resin (1.25 g, 0.25 mmol) is placed in a 20 
mL glass reaction vessel, shaken in DMF for 5 min and the reaction vessel drained. The resin 
was washed with DCM (2 x 30 s.) and the Boc group removed with 80% TFA/DCM/0.5M 

20 PhSH, 1 x 30s., 1 x 20 min The resin was washed with DCM (2 x 30 s.) followed by DMF (1 x 
30 s.) A resin sample (5-10 mg) was taken for analysis. The vessel was drained completely and 
activated monomer added, followed by DIEA if necessary. The reaction vessel was shaken 
vigorously to make a slurry. The coupling was allowed to proceed for 90 min, and a resin 
sample taken. Acetic anhydride (I mL) was added and the reaction shaken for 5 min. The 

25 reaction vessel was then washed with DMF, followed by DCM. 

Machine-Assisted Protocols. Machine-assisted synthesis was performed on a ABI 430A 
synthesizer on a 0.18 mmol scale (900 mg resin; 0.2 mmoygram). Each cycle of amino acid 
addition involved: deprotection with approximately 80% TFA/DCM/0.4M PhSH for 3 minutes, 

30 draining the reaction vessel, and then deprotection for 17 minutes; 2 dichloromethane flow 
washes; an NMP flow wash; draining the reaction vessel; coupling for 1 hour with in situ 
neutralization, addition of dimethyl sulfoxide (DMSO)/NMP, coupling for 30 minutes, addition 
of DIEA, coupling for 30 minutes; draining the reaction vessel; washing with DCM, taking a 
resin sample for evaluation of the progress of the synthesis by HPLC analysis; capping with 

35 acetic anhydride/DIEA in DCM for 6 minutes; and washing with DCM. A double couple cycle 
is employed when coupling aliphatic amino acids to imidazole, all other couplings are 
performed with single couple cycles. 
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The ABI 430A synthesizer was left in the standard hardware configuration for NMP- 
HOBt protocols. Reagent positions 1 and 7 were DIEA, reagent position 2 was TFA/0.5M 
thiophenol, reagent position 3 was 70% ethanol amine/methanol, reagent position 4 was acetic 
anhydride, reagent position 5 was DMSO/NMP, reagent position 6 was methanol, and reagent 
position 8 was DMF: New activator functions were written, one for direct transfer of the 
cartridge contents to the concentrator (switch list 21, 25, 26, 35, 37, 44), and a second for 
transfer of reagent position 8 directly to the cartridge (switch list 37, 39, 45, 46). 

Boc-Py-OBt ester (357 mg, 1 mmol) was dissolved in 2 ml DMF and filtered into a 
synthesis cartridge. Boc-Im acid monomer was activated (DCC/HOBt), filtered, and placed in a 
synthesis cartridge. Imidazole-2-carboxylic acid was added manually. At the initiation of the 
coupling cycle the synthesis was interrupted, the reaction vessel vented and the activated 
monomer added directly to the reaction vessel through the resin sampling loop via syringe. 
When manual addition was necessary an empty synthesis cartridge was used. Aliphatic amino 
acids (2 mmol) and HBTU (1.9 mmol) were placed in a synthesis cartridge. 3 ml of DMF was 
added using a calibrated delivery loop from reagent bottle 8, followed by calibrated delivery of 
1 ml DIEA from reagent bottle 7, and a 3 minute mixing of the cartridge. 

The activator cycle was written to transfer activated monomer directly from the cartridge to 
the concentrator vessel, bypassing the activator vessel. After transfer, 1 ml of DIEA was 
measured into the cartridge using a calibrated delivery loop, and the DIEA solution combined 
with the activated monomer solution in the concentrator vessel. The activated ester in 2:1 
DMF/DIEA was then transferred to the reaction vessel. All lines were emptied with argon 
before and after solution transfers. 

ImlmOpPy-y-ImPyPyPy-fi-Dp. ImlmOpPy-y-ImPyPyPy-P-Pam-Resin was synthesized 
in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 
mmol/g). Baird, E. E, & Dervan, P. B. describes the solid phase synthesis of polyamides 
containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 (1996); also 
see PCT US 97/003332. 3-hydroxypynrole-Boc-amino acid (0.7 mmol) was incorporated by 
placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine synthesis cartridge. 
Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation occurs. A sample of 
ImlmOpPy-y-ImPyPyPy-P-Pam-Resin (400 mg, 0.40 mmol/ gram) was placed in a glass 20 mL 
peptide synthesis vessel and treated with neat dimethylaminopropylamine (2 mL) and heated 
(55 °C) with periodic agitation for 16 h. The reaction mixture was then filtered to remove resin, 
0.1% (wt/v) TFA added (6 mL) and the resulting solution purified by reversed phase HPLC. 
ImImOpPy-y-ImPyPyPy-P-r>p is recovered upon lyophilization of the appropriate fractions as a 
white powder (97 mg, 49% recovery). UV (H 2 0) 246, 316 (66,000); ! H NMR (DMSO-<i 6 ) 
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5 10.24 (s, 1 H), 10.14 (s, 1 H), 9.99 (s, 1 H), 9.94 (s, 1 H), 9.88 (s, 1 H), 9.4 (br s, 1 H), 9.25 (s, 
1 H), 9.1 i (s, 1 H), 8.05 (m, 3 H), 7.60 (s, 1 H), 7.46 (s, I H), 7.41 (s, 1 H), 7.23 (d, I), 7.21 (d, 
1 H), 7.19 (d, 1 H), 7.13 (m, 2 H), 7.11 (m, 2 H), 7.02 (d, 1 H), 6,83 (m, 2 H), 3.96 (s, 6 H), 
3.90 (s, 3 H), 3.81 (m, 6 H), 3.79 (s, 3 H), 3.75 (d, 9 H), 3.33 (q, 2 H, J = 5.4 Hz), 3.15 (q, 2 H, 
5 7-5.5 Hz), 3.08 (q, 2 H, J = 6.0 Hz), 2.96 (quintet, 2H,7= 5.6 Hz), 2.70 (d, 6 H, J = 4.5 Hz), 
2.32 (m, 4 H), 1.71 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1253.5 (1253.6 calc. for 
C 58 H 72 N 22 On). 

ImlmHpPy-y-lmPyPyPy \ In order to remove the methoxy protecting group, a sample of 

10 ImlmOpPy-y-ImPyPyPy-p-Dp (5 mg, 3.9 ^mol) was treated with sodium thiophenoxide at 100 
°C for 2 h. DMF (1000 jiL) and thiophenol (500 nL) were placed in a (13 x 100 mm) disposable 
Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in mineral oil (100 mg) was 
slowly added. Upon completion of the addition of the sodium hydride, ImlmOpPy-y-ImPyPyPy- 
P-Dp (5 mg) dissolved in DMF (500 ^L) was added. The solution was agitated, and placed in a 

15 100 °C heat block, and deprotected for 2 h. Upon completion of the reaction the culture tube 
was cooled to 0°C, and 7 ml of a 20 % (wt/v) solution of trifluoroacetic acid added. The 
aqueous layer is separated from the resulting biphasic solution and purified by reversed phase 
HPLC. ImlmHpPy-y-lmPyPyPy-P-Dp is recovered as a white powder upon lyophilization of 
the appropriate fractions (3.8 mg, 77 % recovery). UV (H 2 0) 246, 312 (66,000); l H NMR 

20 (DMSO-J 6 ) 5 10.34 (s, 1 H), 10.24 (s, 1 H), 10.00 (s, 2 H), 9.93 (s, 1 H), 9.87 (s, 1 H), 9.83 (s, 
1 H), 9.4 (br s, 1 H), 9.04 (s, 1 H), 8.03 (m, 3 H), 7.58 (s, 1 H), 7.44 (s, 1 H), 7.42 (s, 1 H), 7.23 
(s, I H), 7.20 (m, 3 H), 7.12 (m, 2 H), 7.05 (d, 1 H), 7.02 (d, 1 H), 6.83 (s, 1 H), 6.79 (s, 1 H), 
3.96 (s, 6 H), 3.90 (s, 3 H), 3.81 (s, 6 H), 3.79 (s, 3 H), 3.75 (d, 6 H), 3.33 (q, 2 H, J = 5.4 Hz), 
3.14 (q, 2 H,y= 5.4 Hz), 3.08 (q, 2 H,/ = 6.1 Hz), 2.99 (quintet, 2 H,7= 5.4 Hz), 2.69 (d, 6 H, 

25 J = 4.2 Hz), 2.31 (m, 4 H), 1.72 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1239.6 (1239.6 
calc. for C57H71N22O1]). 

ImImPyPy-y-ImOpPyPy-$-Dp. ImlmPyPy-y-ImOpPyPy-fi-Pam-Resin was synthesized 
in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 

30 mmol/g) as described for ImlmOpPy-y-ImPyPyPy-P-Dp. A sample of ImlmPyPy-y-ImOpPyPy- 
P-Pam-Resin (400 mg, 0.40 mmol/gram) was placed in a glass 20 mL peptide synthesis vessel 
and treated with neat dimethylaminopropylamine (2 mL) and heated (55 °C) with periodic 
agitation for 16 h. The reaction mixture was then filtered to remove resin, 0.1% (wt/v) TFA 
added (6 mL) and the resulting solution purified by reversed phase HPLC. ImlmPyPy-y- 

35 ImOpPyPy-p-Dp is recovered upon lyophilization of the appropriate fractions as a white 
powder (101 mg, 50% recovery). UV (H 2 0) ^ 246, 316 (66,000); MALDI-TOF-MS 
(monoisotopic), 1253.6(1253.6 calc. for Csg^^On). 
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ImlmPyPy-y-lmHpPyPy. A sample of ImlmPyPy-y-ImOpPyPy-P-Dp (5 mg, 3.9 nmol) 
was treated with sodium thiophenoxide and purified by reversed phase HPLC as described for 
ImlmHpPy-y-ImPyPyPy-P-Dp. ImlmPyPy-y-ImHpPyPy-P-Dp is recovered upon lyophilization 
of the appropriate fractions as a white powder (3.2 mg, 66 % recovery). UV (H 2 0) 246, 
5 312 (66,000); MALDI-TOF-MS (monoisotopic), 1239.6 (1239.6 calc. for C 5 7H 71 N220 n ). 

ImPyPy-y-OpPyPy~$-Dp. ImPyPy-y-OpPyPy-P-Pam-Resin was synthesized in a 
stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 
mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of polyamides 

10 containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 (1996); also 
see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was incorporated by 
placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine synthesis cartridge. 
Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation occurs. A sample of 
ImPyPy-y-OpPyPy-p-Pam-Resin (400 mg, 0.45 mmol/gram) was placed in a glass 20 mL 

15 peptide synthesis vessel and treated with neat dimethylaminopropylamine (2 mL) and heated 
(55 °C) with periodic agitation for 16 h. The reaction mixture was then filtered to remove resin, 
0.1% (wt/v) TFA added (6 mL) and the resulting solution purified by reversed phase HPLC. 
ImPyPy-y-OpPyPy-P-Dp is recovered upon lyophilization of the appropriate fractions as a 
white powder (45 mg, 25% recovery). UV (H 2 0) ^ ax 246, 310 (50,000); ! H NMR (DMSO-</ 6 ) 

20 5 10.45 (s, 1 H), 9.90 (s, 1 H), 9.82 (s, 1 H), 9.5 (br s, 1 H), 9.38 (s, 1 H), 9.04 (s, 1 H), 8.02 (m, 
3 H), 7.37 (s, 1 H), 7.25 (m, 2 H), 7.15 (d, 1 H, J = 1.6 Hz), 7.11 (m, 2 H), 7.09 (d, 1 H), 7.03 
(d, 1 H), 6.99 (d, 1 H), 6.87 (d, 1 H), 6.84 (d, 1 H), 3.96 (s, 3 H), 3.81 (s, 6 H), 3.77 (s, 6 H), 
3.76 (s, 3 H), 3.74 (s, 1 H), 3.34 (q, 2 H, J= 5.6 Hz), 3.20 (q, 2 H, J = 5.8 Hz), 3.09 (q, 2 H, J = 
6.1 Hz), 2.97 (quintet, 2 H 9 J = 5.3 Hz), 2.70 (d, 6 H, J- 3.9 Hz). 2.34 (m, 4 H), 1.73 (m, 4 H); 

25 MALDI-TOF-MS (monoisotopic), 1007.6 (1007.5 calc. for C 4 8H 63 N 16 0 9 ). 

ImPyPy-y-HpPyPy. In order to remove the methoxy protecting group, a sample of 
ImPyPy-y-OpPyPy-p-Dp (5 mg, 4.8 jimol) was treated with sodium thiophenoxide at 100 °C 
for 2 h. DMF (1000 \iL) and thiophenol (500 |iL) were placed in a (13 x 100 mm) disposable 

30 Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in mineral oil (100 mg) was 
slowly added. Upon completion of the addition of the sodium hydride, ImlmPyPy-y-IniOpPyPy- 
p-Dp (5 mg) dissolved in DMF (500 (iL) was added. The solution was agitated, and placed in a 
100 °C heat block, and deprotected for 2 h. Upon completion of the reaction the culture tube 
was cooled to 0°C, and 7 ml of a 20 % (wt/v) solution of trifluoroacetic acid added. The 

35 aqueous layer is separated from the resulting biphasic solution and purified by reversed phase 
HPLC. ImlmHpPy-y-ImHpPyPy-p-Dp is recovered as a white powder upon lyophilization of 
the appropriate fractions (2.5 mg, 52 % recovery). UV (H 2 0) K* x 246, 310 (50,000); ] H NMR 
(DMSO-rf 6 ) 6 10.44 (s, 1 H), 10.16 (s, 1 H), 9.90 (s, i H), 9.77 (s, 1 H), 9.5 (br s, 1 H), 9.00 (s, 
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1 H), 8.03 (m, 3 H), 7.37 (s, 1 H), 7.26 (m, 2 H), 7.14 (d, 1 H, J= 1.7 Hz), 7.12 (m, 2 H), 7.02 
(d, 1 H), 6.93 (d, 1 H), 6.88 (d, 1 H), 6.82 (d, 1 H), 6.72 (d, 1 H), 3.96 (s, 3 H), 3.81 (s, 6 H), 
3.77 (s, 3 H), 3.76 (s, 3 H), 3.74 (s, 1 H), 3.36 (q, 2 H, J = 5.4 Hz), 3.22 (q, 2 H, J = 5.9 Hz), 
3.09 (q, 2 H, J = 5.5 Hz), 2.98 (quintet, 2 H, J = 5.3 Hz), 2.70 (d, 6 H, J = 4.3 Hz), 2.34 (m, 4 
5 H), 1 .78 (m, 4 H); MAL-DI-TOF-MS (monoisotopic), 994.2 (993.5 calc. for C 47 H 6 ,N 16 0 9 ). 



Table 5. Mass spectral characterization of Op and Hp polyamides, synthesized and purified as 
described for ImlmOpPy-y-ImPyPyPy-p-Dp and ImlmHpPy-y-ImPyPyPy-P-Dp. 



POLYAMIDE 


FORMULA (M+H)CALCD 


FOUND 


imOpry-y-ryryPy-p-Dp 


CdsHfi^Ni^OQ 


1007.5 


1 AA*7 ^ 


ImHpry-y-Pyryry-p-Dp 


Cd7HfitNifiOo 


993.5 




t T*> . r /"*\.-. « . D-»fD-»fTJ»» Q 

ImryOp-y-r y Py r y- p-Dp 


CirHaiNiaOo 


1007.5 


1 AA7 < 


lmryHp-y-ryryry-p-Dp 


CjtHaiNiaOo 


993.5 




ImPyPy-y-OpPyPy-p-Dp 


CjsHaiN^Oo 


1007.5 


1 AA7 /C 
iUU / .O 


IrnPyPy-y-Hpryry-p-Dp 


CjiHriNiAOo 


993.5 




ImPyPy-y-PyOpPy-p-Dp 


C 4 «HfiiNiAOQ 


1007.5 


1UU7.5 


ImPyPy-y-PyripPy-p-Dp 


CdiHAiNiAOo 


993.5 




ImOpOp-y-PyPyPy-p-Dp 


W9 n 65 iN 16^10 






ImHpHp-y-PyPyPy- (3-Dp 


C4 7 H6iNi 6 O l0 


1009.5 


1009.4 


ImlmOpPy-y-ImPyPyPy-P-Dp 


C58H72N22O11 


1253.6 


1253.5 


ImlmHpPy-y-ImPyPyPy-p-Dp 


C 57 H 7l N 22 O u 


1239.6 


1239.6 


ImlmPyPy-y-ImOpPyPy-p-Dp 


C58H72N 22 On 


1253.6 


1253.6 


ImlmPyPy-y-ImHpPyPy-P-Dp 


C 5 7H 7 iN 22 O u 


1239.6 


1239.6 


ImOpPyPy-y-ImOpPyPy-p-Dp 


C6oH76N 2 iOi2 


1282.6 


1282.6 


ImHpPyPy-y-ImHpPyPy-p-Dp 


C 58 H 72 N 2l O l2 


1254.6 


1254.6 


ImlmOpPy-y-ImOpPyPy-P-Dp 


C 5 9H 75 N 22 0 I2 


1283.6 


1283.6 


ImlxnHpPy-y-ImHpPyPy-P-Dp 


C 57 H 7l N 22 O l2 


1255.6 


1255.5 


ImOpPyPy-y-PyPyPyPy-P-Dp 


C6oH75N 2 oOn 


1251.6 


1251.5 


ImPyPyPy-y-PyPyOpPy-p-Dp 


C6oH 7 5N 20 Ou 


1251.6 


1251.5 


ImlmPyPy-y-ImPyOpPy-P-Dp 


C58H 72 N 22 On 


1253.6 


1253.7 


ImOpPyPyPy-y-ImOpPyPyPy-p-Dp 


C72H 88 N 25 0 14 


1526.7 


1526.6 


ImHpPyPyPy-y-ImHpPyPyPy-p-Dp 


C70H84N25O14 


1498.7 


1498.0 


ImlmPyPyPy-Y-ImOpOpPyPy-P-Dp 


C 7 iH 87 N 26 0 14 


1527.7 


1527.7 
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EXAMPLE 3: 
DETERMINATION OF POLY AMIDE 
BINDING AFFINITY AND SEQUENCE SPECIFICITY. 

Representative-footprint titration experiments are shown in Figures 3 and 10. A 252-bp 
DNA fragment which is typically used for the footprint titration experiments provides 247 
possible 6-bp binding sites for an eight-ring hairpin polyamide. Thus, in addition to providing 
DNA binding affinities, the footprint titration experiments also reveal DNA binding sequence- 
specificity. The DNA binding sequence specificity of polyamides which differ by a single 
Py/Py, Hp/Py, or Py/Hp pair for sites which differ by a single A*T or T # A base pair are 
described in Tables 1, 6, and 7. 

Quantitative DNase I Footprint Titrations All reactions were executed in a total volume 
of 400 |iL (Brenowitz, M. et aL, 1986). A polyamide stock solution or H 2 0 (for reference 
lanes) was added to an assay buffer containing 3'- 32 P radiolabeled restriction fragment (20,000 
cpm), affording final solution conditions of 10 mM Tris«HCl, 10 mM KC1, 10 mM MgCl 2 , 5 
mM CaCl 2 , pH 7.0, and either (i) a suitable concentration range of polyamide, or (ii) no 
polyamide (for reference lanes). The solutions were allowed to equilibrate for 24 hours at 22°C. 
Footprinting reactions were initiated by the addition of 10 |aL of a stock solution of DNase I (at 
the appropriate concentration to give -55% intact DNA) containing 1 mM dithiothreitol and 
allowed to proceed for 7 minutes at 22°C. The reactions were stopped by the addition of 50 \xL 
of a solution containing 2.25 M NaCl, 150 mM EDTA, 23 \xM base pair calf thymus DNA, and 
0.6 mg/ml glycogen, and ethanol precipitated. The reactions were resuspended in 1 x TBE/ 
80% formamide loading buffer, denatured by heating at 85°C for 15 minutes, and placed on ice. 
The reaction products were separated by electrophoresis on an 8% polyacrylamide gel (5% 
crosslinking, 7 M urea) in 1 x TBE at 2000 V for 1.5 h. Gels were dried on a slab dryer and 
then exposed to a storage phosphor screen at 22°C. 

Photostimuable storage phosphor imaging plates (Kodak Storage Phosphor Screen 
SO230 obtained from Molecular Dynamics) were pressed flat against dried gel samples and 
exposed in the dark at 22°C for 12-24 hours. A Molecular Dynamics 400S Phosphorlmager 
was used to obtain all data from the storage screens (Johnston et al., 1990). The data were 
analyzed by performing volume integration of the target site and reference blocks using the 
ImageQuant v. 3.3 software running on a Compaq Pentium 80. 

Quantitative DNase I Footprint Titration Data Analysis was performed by taking a 
background-corrected volume integration of rectangles encompassing the footprint sites and a 
reference site at which DNase I reactivity was invariant across the titration generated values for 
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the site intensities (I site ) and the reference intensity (I rcf ). The apparent fractional occupancy 
(9 app ) of the sites were calculated using the equation: 

/•it«//r«f 



= 1 



/•rtd°//rtf° 



(1) 



where I site ° and I TCf ° are the site and reference intensities, respectively, from a DNase I control 
lane to which no poly amide was added. 



The ([L] tQt , 9 app ) data were fit to a Langmuir binding isotherm (eq, 2 } n=l) by 
minimizing the difference between 0 app and 9 ru , using the modified Hill equation: 

K."[LF tot 



0 fit — 0 mux + (Gmax - 9 mm } 



(2) 



1 + Ka n [Lftot 

10 where [L, ol ] is the total polyamide concentration, K a is the equilibrium association constant, and 
0 min and 0 max are the experimentally determined site saturation values when the site is 
unoccupied or saturated, respectively. The data were fit using a nonlinear least-squares fitting 
procedure of KaleidaGraph software (v. 3.0.1, Abelbeck Software) with K a , 8 maX) and 9 mm as the 
adjustable parameters. The goodness of fit of the binding curve to the data points is evaluated 

15 by the correlation coefficient, with R > 0.97 as the criterion for an acceptable fit. Four sets of 
acceptable data were used in determining each association constant. All lanes from a gel were 
used unless a visual inspection revealed a data point to be obviously flawed relative to 
neighboring points. The data were normalized using the following equation: 

9 app - 0 m in 

8 norm = — — — (3) 



Bmin 



20 



TABLE 6 Discrimination of 5'-TGTAA-3' and 5'-TGTTA-3' 



Pair 



5'-TGTAA-3' 



5'-TGTTA-3' 



AT. 



reP 



Py/Py 



5'-T G T 

+ XK>o|or 

3'-A C A 



A-3' 



T-5' 



5'-T G T 

+><>oo)or 

3'-A C A 



A-3' 



T-5' 



2.0 



K a = 0.014 f^M 



K d = 0.007 uM 



5'-T G T 



Py/Hp 



3' -A C A 



A-3' 



5'-T G T 



•OCX •OCX 



3 ( -A C A 



A-3' 



T-5' 



0.36 



K d = 0.20 uM 



AT d = 0.56 u.M 



5"-T G T 



Hp/Py 



3*-A C A 



A-3' 



5'-T G T 



•O®^ #0®N 



T-5* 



3*-A C A 



A-3' 



T-5' 



14 



K 6 = 4.0 uM 



K, s 0,28 uM 



*The reported equilibrium dissociation constants axe the mean values 
obtained from two DNase 1 footprint titration experiments on a 3' K P 
labeled 370-bp pDEHl EcoRI/PuuIl DNA restriction fragment 13 . The 
assays were carried out at 22 °C, pH 7.0 in the presence of 10 mM 
Tris-HCl, 10 mM KO, 10 mM MgCh, and 5 mM CaCl 2 . 
"fRing pairing opposite T*A and A»T in the third position. 
^Calculated as KJ5'-TGTAA'3')/KJ5'-TGTTA-y). 
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TABLE 7 Discrimination of S'-TGTTT-J and 5'-TGATT-3" 
Pairt 5'-TGATT-3' 5'-TGTTT-3' K n fi 



Py/Py 




5*-T G T T T-3' 

3'-A C [aJ A A-5' 
K d = 0.005 \xM 



5.2 



5'-T GAT T-3' 

Hp/Py +><>OCO ; 

3*-A CgA A-5' 
AT d =0,53 nM 



5'-T G T T T-3' 

+XK>OCK 
3'-A C [A] A A-5' 

tf d = 0.008 



66 



Py/Hp 




5'-T G T T T-3' 

tea 

3'-A C [Aj A A-5' 
K 6 = 0.59 \iM 



0.56 



The reported equilibrium dissociation constants are the mean values 
obtained from two DNase I footprint titration experiments. The assays 
were carried out at 22 °C, pH 7.0 in the presence of 10 mM Tris»HCl, 
10 mM KC1, 10 mM MgCl 2 , and 5 mM CaCR 
tRing pairing opposite T*A and A»T in the third position. 
^Calculated as K d (5'-TGATT -3')/K d (5'-TGTTT-3'). 



EXAMPLE 5: 

5 PREPARATION OF A BIFUNCTIONAL Hp-Py-Im-POLY AMIDE. 

ImImOpPy-y-ImPyPyPy*$-Dp-NH2. Imlm^Py-y-ImPyPyPy-P-Pam-Resin was 

synthesized in a stepwise fashion by machine-assisted solid phase methods from Boc-fi-Pam- 
Resin (0.66 mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of 

10 polyamides containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 
(1996); also see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was 
incorporated by placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine 
synthesis cartridge. Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation 
occurs. A sample of ImlmOpPy-y-ImPyPyPy-P-Pam-Resin (400 mg, 0.40 mmol/gram) was 

15 placed in a glass 20 mL peptide synthesis vessel and treated with neat 3,3 , -diamino-A r - 
methyldipropylamine (2 mL) and heated (55 °C) with periodic agitation for 16 h. The reaction 
mixture was then filtered to remove resin, 0.1% (wt/v) TFA added (6 mL) and the resulting 
solution purified by reversed phase HPLC. ImImOpPy-y-ImPyPyPy-p-Dp-NH2 is recovered 
upon lyophilization of the appropriate fractions as a white powder (93 mg, 46% recovery). UV 

20 (H2O) Xmax 246, 316 (66,000); ! H NMR (DMSO-rftf) 5 10.34 (s, 1 H), 10.30 (br s, 1 H), 10.25 
(s, 1 H), 9.96 (s, 1 H), 9.95 (s, 1 H), 9.89 (s, 1 H), 9.24 (s, 1 H), 9.11 (s, 1 H), 8.08 (t, 1 H,J = 
5.6 Hz), 8.0 (m, 5 H), 7.62 (s, 1 H), 7.53 (s, 1 H), 7.42 (s, 1 H), 7.23 (d, 1H, J = 1.2 Hz), 7.21 
(m, 2 H), 7.15 (m, 2 H), 7.13 (d, 1 H), 7.1 1 (m, 2 H), 7.04 (d, 1 H), 6.84 (m, 3 H), 3.98 (s, 3 H), 
3.97 (s, 3 H), 3.92 (s, 3 H), 3.82 (m, 6 H), 3.80 (s, 3 H), 3.77 (d, 6 H), 3.35 (q, 2 H,J= 5.8 Hz) 

25 3.0-3.3 (m, 8 H), 2.86 (q, 2HJ= 5.4 Hz), 2.66 (d, 3 H, J = 4.5 Hz), 2.31 (m, 4 H), 1.94 
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(quintet, 2 H, J = 6.2 Hz), 1.74 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1296.0 (1296.6 
calc. forC60H78N23On) > 

ImImOpPy-y-ImPyPyPy-$~Dp-EDTA. Excess EDTA-dianhydride (50 mg) was dissolved 
5 in DMSO/NMP (1 mL) and DEEA (1 mL) by heating at 55 °C for 5 min. The dianhydride 
solution was added to ImImOpPy-y-ImPyPyPy-.p-NH2 (13 mg, 10 \xmo\) dissolved in DMSO 
(750 nL). The mixture was heated (55 °C, 25 min.) and the remaining EDTA-anhydride 
hydrolyzed (0.1M NaOH, 3 mL, 55 °C, 10 min). Aqueous TFA (0.1% wt/v) was added to 
adjust the total volume to 8 mL and the solution purified directly by reversed phase HPLC to 
10 provide ImlmOpPy-y-ImPyPyPy-p-Dp-EDTA as a white powder upon lyophilization of the 
appropriate fractions (5.5 mg, 40% recovery). MALDI-TOF-MS (monoisotopic), 1570.9 
(1570.7 calc. for C70H92N25O18). 

ImImHpPy-y-ImPyPyPy-$-Dp-EDTA. In order to remove the methoxy protecting group. 

15 a sample of ImlmOpPy-y-ImPyPyPy-P-Dp-EDTA (5 mg, 3.1 \xmol) was treated with sodium 
thiophenoxide at 100 °C for 2 h. DMF (1000 jiL) and thiophenol (500 )iL) were placed in a (13 
x 100 mm) disposable Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in 
mineral oil (100 mg) was slowly added. Upon completion of the addition of the sodium hydride, 
ImlmOpPy-y-ImPyPyPy-P-Dp-EDTA (5 mg) dissolved in DMF (500 \iL) was added. The 

20 solution was agitated, and placed in a 100 °C heat block, and deprotected for 2 h. Upon 
completion of the reaction the culture tube was cooled to 0°C, and 7 ml of a 20 % (wt/v) 
solution of trifluoroacetic acid added. The aqueous layer is separated from the resulting biphasic 
solution and purified by reversed phase HPLC. ImlmHpPy-y-ImPyPyPy-p-Dp-EDTA is 
recovered as a white powder upon lyophilization of the appropriate fractions (3.2 mg, 72 % 

25 recovery). UV (H2O) Wx 246, 312 (66,000); MALDI-TOF-MS (monoisotopic), 1556.6 
(1556.7 calc. for C69H90N25O18). 

ImImPyPy-y-ImOpPyPy-$-Dp'NH2. ImlmPyPy-y-ImOpPyPy-p-Pam-Resin was 
synthesized in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam- 

30 Resin (0.66 mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of 
polyamides containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 
(1996); also see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was 
incorporated by placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine 
synthesis cartridge. Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation 

35 occurs. A sample of ImlmPyPy-y-ImOpPyPy-p-Pam-Resin (400 mg, 0.40 mmol/gram) was 
placed in a glass 20 mL peptide synthesis vessel and treated with neat 3,3 , -diamino-A L 
methyldipropylamine (2 mL) and heated (55 °C) with periodic agitation for 16 h. The reaction 
mixture was then filtered to remove resin, 0.1% (wt/v) TFA added (6 mL) and the resulting 

30 
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solution purified by reversed phase HPLC. ImImPyPy-Y-ImOpPyPy-P-Dp-NH2 is recovered 
upon lyophilization of the appropriate fractions as a white powder (104 mg, 54% recovery). UV 
(H2O) X m ax 246, 316 (66,000); MALDI-TOF-MS (monoisotopic), 1296.6 (1296.6 calc. for 
C60H78N23O11). 

ImImPyPy-y~ImOpPyPy-$-Dp-EDTA. Excess EDTA-dianhydride (50 mg) was dissolved 
in DMSO/NMP (1 mL) and DIEA (1 mL) by heating at 55 °C for 5 min. The dianhydride 
solution was added to ImImPyPy-y-ImOpPyPy-P-NH2 (13 mg, 10 nmol) dissolved in DMSO 
(750 \iL). The mixture was heated (55 °C, 25 min.) and the remaining EDTA-anhydride 
hydrolyzed (0.1M NaOH, 3 mL, 55 °C, 10 min). Aqueous TFA (0.1% wt/v) was added to 
adjust the total volume to 8 mL and the solution purified directly by reversed phase HPLC to 
provide ImlmPyPy-y-ImOpPyPy-fJ-Dp-EDTA as a white powder upon lyophilization of the 
appropriate fractions (5.9 mg, 42% recovery). MALDI-TOF-MS (monoisotopic), 1570.8 
(1570.7 calc. for C70H92N25O18). 

ImImPvPy-y-ImHpPyPy-$-Dp-EDTA. In order to remove the methoxy protecting group, 
a sample of ImlmPyPy-y-ImOpPyPy-P-Dp-EDTA (5 mg, 3.1 jimol) was treated with sodium 
thiophenoxide at 100 °C for 2 h. DMF (1000 \iL) and thiophenol (500 jiL) were placed in a (13 
x 100 mm) disposable Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in 
mineral oil (100 mg) was slowly added. Upon completion of the addition of the sodium hydride, 
ImlmPyPy-Y-ImOpPyPy-P-Dp-EDTA (5 mg) dissolved in DMF (500 nL) was added. The 
solution was agitated, and placed in a 100 °C heat block, and deprotected for 2 h. Upon 
completion of the reaction the culture tube was cooled to 0°C, and 7 ml of a 20 % (wt/v) 
solution of trifluoroacetic acid added. The aqueous layer is separated from the resulting biphasic 
solution and purified by reversed phase HPLC. ImlmPyPy-y-ImHpPyPy-p-Dp-EDTA is 
recovered as a white powder upon lyophilization of the appropriate fractions (3.2 mg, 72 % 
recovery). UV (H2O) ^max 246, 312 (66,000); MALDI-TOF-MS (monoisotopic), 1555.9 
(1556.7 calc. for C69H90N25O18)- 

EXAMPLE 6: 

DETERMINATION OF POLY AMIDE BINDING ORIENTATION 

Affinity cleavage experiments using hairpin polyamides modified with EDTA»Fe(II) at 
either the C-terminus or on the y-turn, were used to determine polyamide binding orientation 
and stoichiometry. The results of affinity cleavage experiments are consistent only with 
recognition of 6-bp by an 8-ring hairpin complex and rule out any extended 1:1 or overlapped 
complex formation. In addition, affinity cleavage experiments reveal hairpin formation 
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supporting the claim that it is the Hp/Py and Py/Hp pairing which form at both match and 
mismatch sites to discriminate A*T from T # A. 

Affinity cleavage reactions were executed in a total volume of 40 p.L. A stock solution of 
5 polyamide or H 2 0 was added to a solution containing labeled restriction fragment (20,000 
cpm), affording final solution conditions of 25 mM Tris-Acetate, 20 mM NaCl, 100 p.M/bp calf 
thymus DNA, and pH 7.0. Solutions were incubated for a minimum of 4 hours at 22°C. 
Subsequently, 4 of freshly prepared 100 jaM Fe(NH4) 2 (S0 4 )2 was added and the solution 
allowed to equilibrate for 20 min. Cleavage reactions were initiated by the addition of 4 }iL of 
10 100 mM dithiothrettol, allowed to proceed for 30 min at 22 °C, then stopped by the addition of 
10 \xL of a solution containing 1.5 M NaOAc (pH 5.5), 0.28 mg/mL glycogen, and 14 jiM base 
pairs calf thymus DNA, and ethanol precipitated. The reactions were resuspended in lx 
TBE/80% formamide loading buffer, denatured by heating at 85 °C for 15 min, and placed on 
ice. The reaction products were separated by electrophoresis on an 8% polyacrylamide gel 
15 (5% cross-link, 7 M urea) in lx TBE at 2000 V for 1.5 hours. Gels were dried and exposed to a 
storage phosphor screen. Relative cleavage intensities were determined by volume integration 
of individual cleavage bands using ImageQuant software. 
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EXAMPLE 7: 

IMPROVEMENT TO POLY AMIDE SEQUENCE SPECIFICITY. 

5 The polyamide^ of this invention provide improved specificity relative to existing 

polyamide technology. Turner, J. T., Baird, E. E., and Dervan, P.B. describe the recognition of 
seven base pair sequences in the minor groove of DNA by ten-ring pyrrole-imidazole 
polyamide hairpins J. Am. Chem. Soc. 1997 119, 7636. For example, quantitative DNasel 
footprint titrations reveal that the 10-ring hairpin ImPyPyPyPy-y-ImPyPyPyPy-P-Dp binds a 5*- 

10 TGTAACA-3- sequence with an equlibrium dissociation constant of 0.083 nM, and 18-fold 
specificity versus a single base mismatch site. A number of other sites are also bound on the 
252-bp DNA fragment used for the footprint titration experiments. (Figure 13). Introduction of 
a Hp/Py and Py/Hp pair in the 10-ring polyamide, ImHpPyPyPy-y-ImHpPyPyPy-P-Dp, to 
recognize a T*A and A-T within the 7-bp target sequence, increases the sequence-specificty. For 

15 example, a single base mismatch site 5'-TGGAACA-3 is discriminated by > 5000-fold (Figure 
13, Table 8). In fact all 245 7-bp mismatch sites present on the restriction fragment are 
discriminated > 5000- fold by the polyamide ImHpPyPyPy-y-ImHpPyPyPy-P-Dp (Figure 13). 
For cases where three A,T base pairs are present in succession it is preferred to substitute Py/Py 
in place of at least one Hp/Py or Py/Hp to provide for recognition of A«T and T»A at a single 

20 position. 



TABLE 8 Equilibrium dissociation constants* 

Polyamidet 5'-TGGTCA-3' 5'-TGGACA-3' AT re l* 

5*-T OfTlAfilC A-3' 5*-T G [g] T [a| C A-3' 

toopcx #OOCCK 

Py/Py +KH>OOdV +><K>OOCV ™ 

3 -A CIAJtItJg T-5* 3-A C [CJ A [tJ G T-5' 

K d = 0.083 nM AT d = 1.5 nM 

5'-T GfTlAfltlc A-3' 5*-T G fol T [a] C A-3' 

Hp/Py +XK>00®-#^ -eKK^PP®^ >5000 

3'-A CIaJtLtJg T-5* 3-A C [CJ A |xj G T-5* 

K d = 0.2 nM K d > 1000 nM 

*The reported dissociation constants are the average values obtained from three 
DNase I footprint titration experiments. The standard deviation for each data set is 
less than 15% of the reported number. Assays were carried out in the presence of 10 
mM Tris»HO, 10 mM KC1, 10 mM MgC12, and 5 mM CaC12 at pH 7.0 and 22 °C 
tRing pairing opposite T»A and A*T in the fourth position. 
^Calculated as K d (5'-TGGTACA-3') / K d (5'-TGTAACA-3'). 



25 EXAMPLE 8: 

USE OF PAIRING CODE 

There are 256 possible four base pair combinations of A, T, G, and C. Of these, there are 
a possible 240 four base pair sequences which contain at least 1 A»T or T>A base pair and 
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therefore can advantageously use an Hp/Py, or Py/Hp carboxamide binding. Polyamides 
binding to any of these sequences can be designed in accordance with the code of TABLE 2. 
Table 9 lists the sixteen eight-ring hairpin polyamides (1-16) which recognize the sixteen 5 1 - 
WGTNNW-3' sequences (W = A or T, X = A, G, C, or T). Table 10 lists the sixteen eight-ring 

5 hairpin polyamides (17-32) which recognize the sixteen 5'-WGANNW-3' sequences (17-32). 
Table 11 lists the twelve eight-ring hairpin polyamides (33-44) which recognize twelve 5'- 
WGGNNW-3' sequences which contain at least one A,T base pair. Table 1 1 lists the four eight- 
ring hairpin polyamides (G1-G4) which target the four S'-WGGNNW-S' sequences (G1-G4) 
which contain exclusively G*C base pairs. Table 12 lists the twelve eight-ring hairpin 

10 polyamides (45-56) which recognize twelve 5'-WGCNNW-3' sequences which contain at least 
one A,T base pair. Table 12 lists the four eight-ring hairpin polyamides (G5-G8) which target 
the four 5'-WGCNNW-3' sequences (G5-G8) which contain exclusively G*C base pairs. Table 
13 lists the sixteen eight-ring hairpin polyamides (57-72) which recognize the sixteen 5'- 
WTTNNW-3' sequences (57-72). Table 14 lists the sixteen eight-ring hairpin polyamides (73- 

15 88) which recognize the sixteen 5'-WTANNW-3' sequences (73-88). Table 15 lists the sixteen 
eight-ring hairpin polyamides (89-104) which recognize the sixteen 5'-WTGNNW-3' sequences 
(89-104). Table 16 lists the sixteen eight-ring hairpin polyamides (105-120) which recognize 
the sixteen 5'-WTCNNW-3' sequences (105-120). Table 17 lists the sixteen eight-ring hairpin 
polyamides (121-136) which recognize the sixteen S'-WATNNW-S' sequences (121-136). 

20 Table 18 lists the sixteen eight-ring hairpin polyamides (137-152) which recognize the sixteen 
5 , -WAANNW-3' sequences (137-152). Table 19 lists the sixteen eight-ring hairpin polyamides 
(153-168) which recognize the sixteen S'-WAGNNWO* sequences (153-168). Table 20 lists 
the sixteen eight-ring hairpin polyamides (169-184) which recognize the sixteen 5'-WACNNW- 
3' sequences (169-184). Table 21 lists the sixteen eight-ring hairpin polyamides (185-200) 

25 which recognize the sixteen 5'-WCTNNW-3' sequences (185-200). Table 22 lists the sixteen 
eight-ring hairpin polyamides (201-216) which recognize the sixteen 5'-WCANNW-3' 
sequences (201-216). Table 23 lists the twelve eight-ring hairpin polyamides (217-228) which 
recognize the twelve 5'-WCGNNW-3* sequences which contain at least one A,T base pair. 
Table 23 lists the four eight-ring hairpin polyamides (G9-G12) which target the four 5'- 

30 WCGNNW-S' sequences (G9-G12) which contain exclusively OG base pairs. Table 24 lists 
the twelve eight-ring hairpin polyamides (229-240) which recognize the twelve S'-WCCNKW- 
3' sequences which contain at least one A,T base pair. Table 24 lists the four eight-ring hairpin 
polyamides (G13-G16) which target the four 5'-WCCNNW-3' sequences (G13-G16) which 
contain exclusively OG base pairs. 
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TABLE 9: 8-ring Hairpin Polyamidcs for recognition of 6-bp 5*-WGTNNW-3' 



5 _ 






DNA sequence 


aromatic amino acid sequence 




1) 




-W G T T T w-3 • 


1) ImHpHpHp - y - PyPy Py Py 




2) 


5' 


-W G T T A W-3 1 


2) ImHpHpPy-y-HpPyPyPy 


10 


3) 


5' 


-W G T T G W-3 1 


3) ImHpHpIm-y-PyPyPyPy 




4) 


5' 


-W G T T C W-3 1 


4) ImHpHpPy-y- ImPyPyPy 


15 


3 / 


5 1 


-W G T A T W-3 1 


5) ImHpPyHp-y-PyHpPyPy 


6) 


5' 


-W G T A A W-3 1 


6) imHpPyPy-y-HpHpPyPy 




7) 


5' 


-W G T A G W-3 1 


7) imHpPylm-y-PyHpPyPy 


20 


8) 


5' 


-W G T A C W-3 1 


8) ImHpPyPy-y-lmHpPyPy 




9) 


5* 


-W G T G T W-3 ' 


9) imHpImHp-y-PyPyPyPy 




10) 


5» 


-W G T G A W-3 1 


10) ImHpImPy-y-HpPyPyPy 


25 


1 1 \ 

XX / 


5 1 


-W G T G G W-3 ' 


11) imHpImlm-y-PyPyPyPy 




12) 


5' 


' -W G T G C W-3 1 


12) ImHpImPy-y-lmPyPyPy 


30 


13) 


5 


1 -W G T C T W- 3 1 


13) ImHpPyHp-y-PylmPyPy 




14) 


5 


' -W G T C A W-3 1 


14) ImHpPyPy-y-HpImPyPy 


35 


15) 


5 


* -W G T C G W-3 1 


15) ImHpPylm-y-PylmPyPy 


16) 


5 


1 -W G T C C W- 3 1 


16) imHpPyPy-y-lmlmPyPy 
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TABLE 10: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WGANNW-3' 



10 



15 



20 



25 



30 



DNA sequence 



35 



17) 5'-W G A T T W-3 

18) 5 1 -W G A T A W-3 

19) 5 « -W G A T G W-3 

20) 5 1 -W G A T C W-3 

21) 5 1 -W G A A T W-3 

22) 5 1 -W G A A A W-3 

23) 5 ' -W G A A G W-3 

24) 5 1 -W G A A C W-3 

25) 5 ' -W G A G T W-3 

26) 5 1 -W GAGA W-3 

27) 5 1 -W G A G G W-3 

28) 5 1 -W G A G C W-3 

29) 5 1 -W G A C T W-3 

30) 5 1 -W G A C A W-3 

31) 5 1 -W G A C G W-3 

32) 5 1 -W G A C C W-3 



aromatic amino acid sequence 



17) ImPyHpHp-y-PyPyHpPy 

18) ImPyHpPy-y-HpPyHpPy 

19) ImPyHpIm-y-PyPyHpPy 

20) ImPyHpPy-y- ImPyHpPy 

21) ImPyPyHp-y-PyHpHpPy 

22) ImPyPyPy-y-HpHpHpPy 

23) ImPyPylm-y-PyHpHpPy 

24) ImPyPyPy-y- ImHpHpPy 

25) ImPylmHp-y-PyPyHpPy 

26) ImPylmPy-y-HpPyHpPy 

27) ImPylmlm-y- PyPyHpPy 

28) ImPylmPy-y- ImPyHpPy 

29) ImPyPyHp-y-PylmHpPy 

30) ImPyPyPy-y-HpImHpPy 

31) ImPyPylm-y-PylmHpPy 

32) ImPyPyPy-y- imlmHpPy 
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TABLE 1 1 : 8-ring Hairpin Poly amides for recognition of 6-bp 5'-WGGNNW-3 T 





DNA sequence 


aromatic amino acid sequence 


33) 


5 1 


-W G G T T W- 3 ' 


33) ImlmHpHp-y-PyPyPyPy 


34) 


5 1 


-W G G T A W-3 ■ 


34) ImlmHpPy-y-HpPyPyPy 


35) 


5 1 


-W G G T G W-3 1 


35) ImlmHpIm-y-PyPyPyPy 


36) 


5 1 


-W G G T C W- 3 1 


36) ImlmHpPy-y-ImPyPyPy 


37) 


5 1 


-W G G A i W-J 


37) ImlmPyHp-y-PyHpPyPy 


38) 


c i 


— Ti V? VJ A A rf J 


38) ImlmPyPy-y-HpHpPyPy 


39) 


C I 
D 


— W \3 V» A Vj W — J 


39) ImlmPylm-y-PyHpPyPy 


40) 


K 1 


— ri \3 o A v* rf — J 


40) ImlmPyPy-y-ImHpPyPy 


41) 


C I 

o 


m n n n T W — 1 
- ri \j \j \y 1 ri — j 


41) ImlmlmHp-y-PyPyPyPy 


42) 


ec i 
D 


— W b u u n rl — J 


42) ImlmlmPy-y-HpPyPyPy 


43) 


5' 


1 -W G G C T W-3 ■ 


43) ImlmPyHp-y-PylmPyPy 


44) 


5 


1 -W G G C A W-3 1 


44) ImlmPyPy-y-HpImPyPy 


Gl) 


5 


1 -W G G G G W-3 ■ 


Gl) Imlmlmlm-y-PyPyPyPy 


G2) 


5 


' -W G G G C W-3 1 


G2) ImlmlmPy-y-ImPyPyPy 


G3) 


5 


• -W G G C G W-3 1 


G3) ImlmPylm-y-PylmPyPy 


G4) 


5 


1 -W G G C C W-3 » 


G4) ImlmPyPy-y-ImlmPyPy 
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TABLE 12: 8-ring Hairpin Polyamides for recognition of 6-bp 5*-WGCNNW-3' 



DNA sequence 



10 



15 



20 



25 



30 



aromatic amino acid sequence 



35 



45) 5 

46) 5 

47) 5 

48) 5 

49) 5 

50) 5 

51) 5 

52) 5 

53) 5 

54) 5 

55) 5 

56) 5 
G5) 5 
G6) 5 
G7) 5 
G8) 5 



-W G C T T W-3 
-W G 'C T A W-3 
-W G C T G W-3 
-W G C T C W-3 
-W G C A T W-3 
-W G C A A W-3 
-W G C A G W-3 
-W G C A C W-3 
-W G C G T W-3 
-W G C G A W-3 
-W G C C T W-3 
-W G C C A W-3 
-W G C G G W-3 
-W G C G C W-3 
^ G C C G W-3 
-W G C C C W-3 



45) ImPyHpHp-y-PyPylmPy 

46) ImPyHpPy-y-HpPylmPy 

47) ImPyHpIm-y-PyPylmPy 

48 ) ImPyHpPy-y- ImPylmPy 

49) ImPyPyHp-y-PyHpImPy 

50) ImPyPyPy-y-HpHpImPy 

51) ImPyPylm-y-PyHpImPy 

52) ImPyPyPy-y- ImHpImPy 
5 3 ) ImPylmHp-y-PyPylmPy 

54) ImPylmPy-y-HpPylmPy 

55) ImPyPyHp-y-PylmlmPy 

56) ImPyPyPy-y-HpImlmPy 
G5) ImPylmlm-y-PyPylmPy 
G6) ImPylmPy-y-ImPylmPy 
G7) ImPyPylm-y- PylmlmPy 
G8) ImPyPyPy-y- ImlmlmPy 



BNSOOC1D: <WO 9837066A 1 _L> 



38 



I 



WO 98/37066 



PCT/US98/01006 



TABLE 13: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WTTNNW-3' 



DNA sequence 



aromatic amino acid sequence 



57) 5 1 -W T T T T W-3 

58) 5 1 -W T T T A W-3 

59) 5 1 -W T T T G W-3 

60) 5 ' -W T T T C W-3 

61) 5 1 -W T T A T W-3 

62) 5 * -W T T A A W-3 

63) 5 1 -W T T A G W-3 
.64) 5* -W T T A C W-3 

65) 5 1 -W T T G T W-3 

66) 5 1 -W T T G A W-3 • 

67) 5 ' -W T T G G W-3 " 

68) 5 1 -W T T G C W-3 • 

69) 5 ' -W T T C T W-3 • 

70) 5 ■ -W T T C A W-3 1 

71) 5 ' -W T T C G W-3 ' 

72) 5 1 -W T T C C W-3 ' 



5 7 ) HpHpHpHp -y-PyPyPyPy 
5 8 ) HpHpHpPy -y - KpPy PyPy 

5 9 ) HpHpHpIm-y- PyPyPyPy 

6 0 ) HpHpHp Py-y- 1 mPy Py Py 
6 1 ) HpHpPyHp -y - PyHpPy Py 
6 2 ) HpHpPy Py -y - HpHpPyPy 

63) HpHpPyIm-y-PyHpPyPy 

64) HpHpPyPy-y-ImHpPyPy 
6 5 ) HpHp ImHp - y - Py Py Py Py 

66) HpHpImPy-y-HpPyPyPy 

67) HpHpImlm-y-PyPyPyPy 

68) HpHpImPy-y-ImPyPyPy 
6 9 ) HpHp Py Hp - y - Py I mPy Py 

70) HpHpPyPy-y-HpImPyPy 

71) HpHpPylm-y-PylmPyPy 

72 ) KpHpPyPy-y- ImlmPyPy 



39 



WO 98/37066 PCTYUS98/01006 



TABLE 14: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WTANNW-3' 



DNA sequence 



aromatic amino acid sequence 



10 



15 



20 



25 



30 



35 



73) 5 

74) 5 

75) 5 

76) 5 

77) 5 

78) 5 

79) 5 

80) 5 

81) 5 

82) 5 

83) 5 

84) 5 

85) 5 

86) 5 

87) 5 

88) 5 



-W T A T T W-3 
-W TATA W-3 
-W T A T G W-3 
-W T A T C W-3 
-W T A A T W-3 
-W T A A A W-3 
-W T A A G W-3 
-W T A A C W-3 
-W T A G T W-3 
-W T A G A W-3 
-W T A G G W-3 
-W T A G C W-3 
-W TACT W-3 
-W T A C A W-3 
-W T A C G W-3 
-W T A C C W-3 



73) HpPyHpHp-y-PyPyHpPy 

74 ) HpPyHpPy-y-HpPyHpPy 

75) HpPyHpIm-y-PyPyHpPy 
7 6 ) Hp Py Hp Py - y - 1 m Py Hp Py 
7 7 ) Hp Py Py Hp - y - PyHpHp Py 
78) HpPyPyPy-y-HpHpHpPy 

7 9 ) Hp Py Py Im - y - Py HpHp Py 

80) HpPyPyPy-y-ImHpHpPy 

8 1 ) Hp Py I mHp - y - Py Py Hp Py 

82 ) HpPylmPy-y-HpPyHpPy 

83 ) HpPylmlm-y-PyPyHpPy 

84 ) HpPylmPy-y - ImPyHpPy 

8 5 ) HpPy PyHp - y - Py ImHpPy 

86) HpPyPyPy-y-HpImHpPy 

87) HpPyPylm-y-PylmHpPy 

88 ) HpPyPyPy-y- ImlmHpPy 



BNSDOCID: <WO 9837066A 1 J _> 



40 



WO 98/37066 



PCT/US98/01006 



TABLE IS: 8-ring Hairpin Polyamides for recognition of 6-bp 5 T -WTGNNW-3' 



DNA sequence 



aromatic amino acid sequence 



10 



15 



20 



25 



30 



35 



89) 

90) 

91) 

92) 

93) 

94) 

95) 

96) 

97) 

98) 

99) 

100) 

101) 

102) 

103) 

104) 



-W T G T T W-3 
-W T G T A W-3 
-W T G T G W-3 
-W T G T C W-3 
-W T G A T W-3 
-W T G A A W-3 
-W T G A G W-3 
-W T G A C W-3 
-W T G G T W-3 
-W T G G A W-3 
-W T G C T W-3 
-W T G C A W-3 
-W T G G G 
-W T G G C W-3 
-W T G C G W-3 
-W T G C C W-3 



89) HpImHpHp-y-PyPyPyPy 

90) HpImHpPy-y-HpPyPyPy 

91) HpImHpIm-y-PyPyPyPy 

92) HpImHpPy-y- ImPyPyPy 
9 3 ) HpImPyHp -y- PyHpPyPy 

94) HpImPyPy-y-HpHpPyPy 

95) HpImPylm-y- PyHpPyPy 

96) HpImPyPy-y-ImHpPyPy 

97) HpImlmHp-y-PyPyPyPy 

98) HpImlmPy-y-HpPyPyPy 

99) HpImPyHp-y- PylmPyPy 

100) HpImPyPy-y-HpImPyPy 

101) HpImlmlm-y-PyPyPyPy 

102 ) HpImlmPy-y- ImPyPyPy 

103) HpImPylm-y- PylmPyPy 

104) HpImPyPy-y-ImlmPyPy 



4) 
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TABLE 16: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WTQ^NW-3' 



10 



15 



20 



25 



30 



35 



DNA sequence 

105) 5 1 -W T C T T W-3 

106) 5 1 -W T C T A W-3 

107) 5 1 -W T C T G W-3 

108) 5 ' -W T C T C W-3 

109) 5 1 -W T C A T W-3 

110) 5 • -W T C A A W-3 

111) 5 1 -W T C A G W-3 

112) 5 1 -W T C A C W-3 

113) 5 ' -W T C G T W-3 

114) 5 1 -W T C G A W-3 

115) 5 1 -W T C C T W-3 

116) 5 1 -W T C C A W-3 

117) 5 1 -W T C G G W-3 

118) 5 1 -W T C G C W-3 

119) 5 1 -W T C C G W-3 

120) 5 1 -W T C C C W-3 



aromatic amino acid sequence 

1 0 5 ) Hp PyHpHp -y-Py Py ImPy 

106) HpPyHpPy-y-HpPylmPy 

107 ) HpPyHpIm-y-PyPylmPy 

108 ) HpPyHpPy -y- ImPylmPy 
10 9) HpPyPyHp-y- PyHpImPy 

110 ) HpPyPyPy-y-HpHpImPy 

111) HpPyPy Im -y - PyHpImPy 

112 ) HpPyPyPy-y- ImHpImPy 

113 ) HpPylmHp-y- PyPylmPy 

114 ) HpPy ImPy-y-HpPylmPy 

115) HpPyPyHp-y-PylmlmPy 

116) HpPyPyPy-y-HpImlmPy 

117 ) HpPylmlm-y-PyPylmPy 

118 ) HpPylmPy-y- ImPylmPy 

119) HpPyPylm-y-PylmlmPy 
12 0) HpPyPyPy-y - ImlmlmPy 
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TABLE 17: 8-ring Hairpin Polyamides for recognition of 6-bp 5*-WATNNW-3' 



DNA sequence 



121) 5 1 -W A T T T W-3 1 

122) 5 1 -W A T T A W-3 1 

123) 5 1 -W A T T G W-3 ' 

124) 5 ■ -W A T T C W-3 1 

125) 5 1 -W A T A T W-3 1 

126) 5 • -W A T A A W-3 » 

127) 5 1 -W A T A G W-3 ■ 

128) 5 1 -W A T A C W-3 1 

129) 5 1 -W A T G T W-3 1 

130) 5 1 -W A T G A W-3 1 

131) 5 1 -W A T G G W-3 1 

132) 5 1 -W A T G C W-3 ' 

133) 5 1 -W A T C T W-3 • 

134) 5 1 -W A T C A W-3 1 

135) 5 ' -W A T C G W-3 1 

136) 5 1 -W A T C C W-3 • 



aromatic amino acid sequence 

121) PyHpHpHp-y-PyPyPyHp 

122) PyHpHpPy-y-HpPyPyHp 
12 3) PyHpHpIm-y- PyPyPyHp 
124) PyHpHpPy-y-ImPyPyHp 
12 5) PyHpPyHp-y- PyHp PyHp 
126) PyHpPyPy-y-HpHpPyHp 
12 7) PyHpPylm-y-PyHpPyHp 
12 8 ) PyHpPyPy-y- ImHpPyHp 

12 9) PyHp ImHp - y - Py Py PyHp 

130) PyHpImPy-y-HpPyPyHp 

131) PyHpImlm-y-PyPyPyHp 

13 2) PyHp ImPy - y - 1 mPy PyHp 

133) PyHpPyHp-y-PylmPyHp 

134) PyHpPyPy-y-HpImPyHp 

135) PyHpPylm-Y-PylmPyHp 
13 6) PyHpPyPy-y-ImlmPyHp 
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TABLE 18: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WAANNW-3' 



DNA sequence 



aromatic amino acid sequence 



10 



15 



20 



25 



30 



137) 5 

138) 5 

139) 5 

140) 5 

141) 5 

142) 5 

143) 5 

144) 5 

145) 5 

146) 5 

147) 5 

148) 5 

149) 5 

150) 5 

151) 5 

152) 5 



-W A A T T W-3 
-W A A T A W-3 
-W A A T Q W-3 
-W A A T C W-3 
-W A A A T W-3 
-W A A A A W-3 
-W A A A G W-3 
-W A A A C W-3 
-W A A G T W-3 
-W A A G A W-3 
-W A A G G W-3 
-W A A G C W-3 
-W A A C T W-3 
-W A A C A W-3 
~W A A C G W-3 
-W A A C C W-3 



13 7) PyPyHpHp -y - PyPyHpHp 

13 8) PyPyHpPy-y-HpPyHpHp 

13 9) PyPyHpIm-y- PyPyHpHp 

14 0) PyPyHpPy-y-ImPyHpHp 

141) PyPyPyHp-y-PyHpHpHp 

142) PyPyPyPy-y-HpHpHpHp 
14 3) PyPyPylm-y-PyHpHpHp 

144 ) PyPyPyPy-y- ImHpHpHp 

145) PyPylmHp-y- PyPyHpHp 
14 6) PyPylmPy-y-HpPyHpHp 
147 ) PyPylmlm-y- PyPyHpHp 
14 8) PyPylmPy-y-ImPyHpHp 
14 9) PyPyPyHp-y-PylmHpHp 

150) PyPyPyPy-y-HpImHpHp 

151) PyPyPylm-y- PylmHpHp 

152 ) PyPyPyPy-y- ImlmHpHp 
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TABLE 19: 8-ring Hairpin Polyamides for recognition of 6-bp S^WAGNNW-3* 
DNA sequence aromatic amino acid sequence 



5 


153) 


5' 


-WAG 


T 


T W-3 ' 


153) PylmHpHp-y-PyPyPyHp 




154) 


5' 


-WAG 


T 


A W-3 1 


154) PylmHpPy-y-HpPyPyHp 


10 


155) 


5' 


-WAG 


T 


G W-3 ' 


15 5) Py imHp Im-y- Py Py PyHp 


156) 


5' 


-WAG 


T 


C W-3 1 


156) PylmHpPy-y- ImPyPyHp 




157) 


5' 


-WAG 


A 


T W-3' 


157) PylmPyHp-y-PyHpPyHp 


15 


158) 


5' 


'-WAG 


A 


A W-3 • 


158) PylmPyPy-y-HpHpPyHp 




159) 


5 1 


'-WAG 


A 


G W-3 1 


159) PylmPylm-y-PyHpPyHp 




160) 


5 


'-WAG 


A 


C W-3 ' 


160) PylmPyPy-y-ImHpPyHp 


20 


161) 


5 


•-WAG 


G 


T W-3 1 


161) PylmlmHp-y-PyPyPyHp 




162) 


5 


'-WAG 


G 


A W-3 ' 


162) PylmlmPy-y-HpPyPyHp 


25 


163) 


5 


'-WAG 


C 


T W-3 ' 


16 3) Py ImPyHp -y- PylmPyHp 




164) 


5 ' 


-WAG 


C 


A W-3 ' 


164) PylmPyPy-y-HpImPyHp 




165) 


5' 


-WAG 


G 


G W-3 1 


165) Pylmlmlm-y-PyPyPyHp 


30 


166) 


5 ' 


-WAG 


G 


C W-3 1 


166) PylmlmPy-y-ImPyPyHp 




167) 


5' 


-WAG 


C 


G W-3 1 


167) PylmPylm-y- PylmPyHp 


35 


168) 


5' 


-WAG 


C 


C W-3 ' 


16 8) PylmPyPy-y-ImlmPyHp 
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TABLE 20: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WACNNW-3' 
DNA sequence aromatic amino acid sequence 



10 



15 



20 



25 



30 



35 



169) 5 

170) 5 

171) 5 

172) 5 

173) 5 

174) 5 

175) 5 

176) 5 

177) 5 

178) 5 

179) 5 

180) 5 

181) 5 

182) 5 

183) 5 

184) 5 



-W A C T T W-3 
-W ACTA W-3 
-W A C T G W-3 
-W A C T C W-3 
-W A C A T W-3 
-W A C A A W-3 
-W A C A G W-3 
-W A C A C W-3 
-W A C G T W-3 
-W A C G A W-3 
-W A C C T W-3 
-W A C C A W-3 
-W A C G G W-3 
-W A C G C W-3 
-W A C C G W-3 
-W A C C C W-3 



169) PyPyHpHp-y-PyPylmHp 

170) PyPyHpPy-y-HpPylmHp 

171) PyPyHpIm-y-PyPylmHp 

172) PyPyHpPy-y- ImPylmHp 

173) PyPyPyHp-y-PyHpImHp 

174) PyPyPyPy-y-HpHpImHp 

175) PyPyPylm-y-PyHpImHp 

176) PyPyPyPy-y- ImHpImHp 

177) PyPylmHp-y-PyPylmHp 

178) PyPylmPy-y-HpPylmHp 

179) PyPyPyHp-y-PylmlmHp 

180) PyPyPyPy-y-HpImlmHp 

181) PyPylmlm-y-PyPylmHp 

182 ) PyPylmPy-y- ImPylmHp 

183) PyPyPylm-y-PylmlmHp 

184 ) PyPyPyPy-y- ImlmlmHp 
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TABLE 21 : 8-ring Hairpin Polyamides for recognition of 6-bp 5 T -WCTNNW-3 T 
DNA sequence aromatic amino acid sequence 



5 


185) 


5' 


-W 


c 


T 


T 


T 


W-3 ' 


185) PyHpHpHp-y-PyPyPylm 




186) 


5' 


-W 


c 


T 


T 


A 


W-3 • 


186) PyHpHpPy-y-HpPyPylm 


10 


187) 


5' 


-W 


c 


T 


T 


G 


W-3 1 


187) PyHpHpIm-y-PyPyPylm 


188) 


5' 


-W 


c 


T 


T 


C 


W-3 1 


18 8) PyHpHpPy-y-imPyPylm 




189) 


5« 


-W 


c 


T 


A 


T 


W-3 ■ 


189) PyHpPyHp-y-PyHpPylm 


15 


190) 


5' 


-W 


c 


T 


A 


A 


W-3 • 


190) PyHpPyPy-y-HpHpPylm 




191) 


5' 


-W 


c 


T 


A 


G 


W-3 ' 


191) PyHpPylm-y-PyHpPylm 


20 


192) 


5' 


-W 


c 


T 


A 


C 


W-3 * 


192) PyHpPyPy-y-ImHpPylm 


193) 


5' 


-W 


c 


T 


G 


T 


W-3 1 


193) PyHpImHp-y-PyPyPylm 








- w 


c 


T 


G 


A 


W-3 1 


194) PyHpImPy-y-HpPyPylm 


25 


195) 


5 1 


-w 


c 


T 


G 


G 


W-3 • 


195) PyHpImlm-y-PyPyPylm 




196) 


5' 


-w 


c 


T 


G 


C 


W-3 ' 


196) PyHpImPy-y-ImPyPylm 


30 


197) 


5 ' 


-w 


c 


T 


C 


T 


W-3 * 


1 97 ) PyHpPyHp -y - PylmPy Im 




198) 


5' 


-w 


c 


T 


C 


A 


W-3 ' 


198) PyHpPyPy-y-HpImPylm 




199) 


5 


-w 


c 


T 


C 


G 


W-3 ' 


199) PyHpPylm-y-PylmPylm 


35 


200) 


5 


-w 


c 


T 


c 


C 


W-3 ' 


200) PyHpPyPy-y- ImlmPylm 
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TABLE 22: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCANNW-3 ' 



10 



15 



20 



25 



30 



DNA sequence 



aromatic amino acid sequence 



35 



201) 5 

202) 5 

203) 5 

204) 5 

205) 5 

206) 5 

207) 5 

208) 5 

209) 5 

210) 5 

211) 5 

212) 5 

213) 5 

214) 5 

215) 5 

216) 5 



-W C A T T W-3 
-W CA T A W-3 
-W C A T G W-3 
-W C A T C W-3 
-W C A A T W-3 
-W C A A A W-3 
-W C A A G W-3 
-W C A A C W-3 
-W C A G T W-3 
-W C A G A W-3 
-W C A G G W-3 
-W C A G C W-3 
-W C A C T W-3 
-W C A C A W-3 
-W C A C G W-3 
-W C A C C W-3 



201) PyPyHpHp-y-PyPyHpIm 

202) PyPyHpPy-y-HpPyHpIm 

203) PyPyHpIm-y-PyPyHpIm 

204) PyPyHpPy-y-ImPyHpIm 

205) PyPyPyHp-y-PyHpHpIm 

206) PyPyPyPy-y-HpHpHpIm 

207) PyPyPylm-y-PyHpHpIm 
2 08) PyPyPyPy-y-ImHpHplTTi 

209) PyPylmHp-y-PyPyHpIm 

210) PyPylmPy-y-HpPyHpIm 

211) PyPylmlm-y-PyPyHpIm 

212) PyPylmPy-y- ImPyHpIm 

213 ) PyPyPyHp-y-PylmHpIm 

214 ) PyPyPyPy-y-HpImHpIm 

215) PyPyPylm-y-PylmHpIm 

216) PyPyPyPy-y- ImlmHpIm 
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TABLE 23: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCGNNW-3' 
DNA sequence aromatic amino acid sequence 



217) 


5' 


-W 


c 


G 


T 


T 


W-3 • 


217) PylmHpHp-y-PyPyPylm 


218) 


5 • 


-w 


~c 


G 


T 


A 


W-3 « 


218) PylmHpPy-y-HpPyPylm 


219) 


5' 


-w 


c 


G 


T 


G 


W-3 1 


219) PylmHpIm-y-PyPyPylm 


220) 


5* 


-w 


c 


G 


T 


C 


W-3 ■ 


220) PylmHpPy-y- ImPyPylm 


221) 


5' 


-w 


c 


G 


A 


T 


W-3 1 


221) PylmPyHp-y-PyHpPylm 


222) 


5' 


-w 


c 


G 


A 


A 


W-3 1 


222) PylmPyPy-y-HpHpPylm 


223) 


5 1 


' -w 


c 


G 


A 


G 


W-3 1 


223) PylmPylm-y-PyHpPylm 


224) 


5' 


• -w 


c 


G 


A 


C 


W-3 1 


224) PylmPyPy-y-ImHpPylm 


225) 


5 


• -w 


c 


G 


G 


T 


W-3 ■ 


225) PylmlmHp-y-PyPyPylm 


226) 


5 


' -w 


c 


G 


G 


A 


W-3 ' 


226) PylmltnPy-y-HpPyPylm 


227) 


5 


• -w 


c 


G 


C 


T 


W-3 ' 


227) PylmPyHp-y-PylmPylm 


228) 


5' 


-w 


c 


G 


C 


A 


W-3 1 


228) PylmPyPy-y-HpImPylm 


G9) 


5' 


-w 


c 


G 


G 


G 


W-3 1 


G9) Pylmlmlm-y-PyPyPylm 


G10) 


5 ' 


-w 


c 


G 


G 


C 


W-3 • 


G10) PylmlmPy-y-ImPyPylm 


Gil) 


5' 


-w 


c 


G 


C 


G 


W-3 1 


Gil) PylmPylm-y-PylmPylm 


G12) 


5 • 


-w 


c 


G 


C 


C 


W-3 1 


G12 ) PylmPyPy-y- ImlmPylm 
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TABLE 24: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCCNNW-3 T 
DNA sequence aromatic amino acid sequence 



5 


229) 


5* 


-W 


c 


c 


T 


T 


W-3 « 


229}PyPyHpHp-y- 


PyPylmlm 




230) 


5 1 


-w 


c- 


c 


T 


A 


W-3 ■ 


230) PyPyHpPy-y- 


HpPylmlm 


10 


231) 


5 ' 


-W 


Q 


Q 


T 


G 


W-3 1 


Oii ) PvPvHnTm-v- 


try try i. Ill X ill 




232) 


5 • 


-w 


c 


c 


T 


C 


W-3 • 


232) PyPyHpPy-y- 


ImPyltnlm 




233) 


5' 


-w 


c 


c 


A 


T 


W-3 ' 


23 3) PyPyPyHp-y- 


PyHpImlm 


15 


234) 


5 » 


-w 


c 


c 


A 


A 


W-3 ■ 


234) PyPyPyPy-y- 


HpHpImlm 




235) 


5* 


-w 


c 


c 


A 


G 


W-3 ' 


235) PyPyPylm-y- 


PyHpImlm 


20 


236) 


5 1 


-W 


Q 


Q 




C 


n — j 


£. j o } try try try try j 


x. nuri^j x ill J. in 




237) 


5 ' 


-w 


c 


c 


G 


T 


W-3 • 


237) PyPylmHp-y- 


PyPylmlm 




238) 


5 


-w 


c 


c 


G 


A 


W-3 ' 


238 ) PyPylmPy-y- 


•HpPylmlm 


25 


239) 


5 


-w 


c 


c 


C 


T 


W-3 1 


2 3 9) PyPyPyHp-y- 


■Pylmlmlm 




240) 


5 


-w 


c 


c 


c 


A 


W-3 ■ 


2 4 0) PyPyPyPy-y- 


-Hplmlmlm 


30 


G13) 


5 


-w 


c 


c 


G 


G 


W-3 ' 


G13) PyPylmlm-y- 


-PyPylmlm 




G14) 


5 


' -w 


c 


c 


G 


C 


W-3' 


G14 ) PyPylmPy-y 


- ImPylmlm 




G15) 


5 


' -w 


c 


c 


C 


G 


W-3 ' 


G15) PyPyPylm-y 


-Pylmlmlm 


35 


G16) 


5 


• -w 


c 


c 


C 


C 


W-3 » 


G16 } PyPyPyPy-y 


- Imlmlmlm 



EXAMPLE 9: 

Aliphatic/Aromatic amino acid pairing for recognition of the DNA minor groove. 

40 

Selective placement of an aliphatic P-alanine (p) residue paired side-by-side with either 
a pyrrole (Py) or imidazole (Im) aromatic amino acid is found to compensate for sequence 
composition effects for recognition of the minor groove of DNA by hairpin pyrrole-imidazole 
polyamides. A series of polyamides were prepared which contain pyrrole and imidazole 
45 aromatic amino acids, as well as y-aminobutyric acid (y) li tuirT and p-alanine "spring" aliphatic 
amino acid residues. The binding affinities and specificities of these polyamides are regulated 
by the placement of paired p/p Py/p and Im/p residues. Quantitative footprint titrations 
demonstrate that replacing two Py/Py pairings in a 12-ring hairpin (6-y-6) with two Py/p 
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pairings affords 1 0-fold enhanced affinity and similar sequence specificity for an 8-bp target 
sequence. 

Table 25 Equilibrium association constants (M~ * ) for polyamides. qc 



Polyamide 5' 


TP'I't' A A Z* 1 A 1' 

- rGTTAAt-A-J 


j'lul uAALA-J 




A ^ ^-i <^ /-v f-\ 


2.5 x 10 9 


3.9 x 10 8 


6 




1.3 x 10 9 


2.0 x 10 8 


7 




1.7 x IU 






KHHXKX 


1.2 x 10" 


2.2 x 10 9 


55 




6.6 x 10 9 


2.5 x 10* 


26 


•-0-OOOCK 


4.5 x 10 10 


7.7 x 10 9 


6 




2.7 x 10 10 


5.7 x 10 9 


5 




s 1 x 10 8 


s I x 10 8 


1 



° Values reported are the mean values obtained from three DNase 1 
footprint titration experiments. b The assays were carried out at 22 °C at 
pH 7.0 in the presence of 10 mM Tris-HCl, 10 mM KG, 10 mM MgCl 2 , 
and 5 mM CaCK. c Match site association constants and specificities 
higher than the parent hairpin arc shown in boldtype. ^Specificity is 
calculated as k a (match) / K a (mismatch). 

The 6-Y-6 hairpin ImPylmPyPyPy-y-ImPyPyPyPyPy-P-Dp, which contains six 
consecutive amino acid pairings, is unable to discriminate a single-base-pair mismatch site 5'- 
TGTTAACA-3' from a 5 * -TGTG AAC A- 3 ' match site. The hairpin polyamide Im-P- 
ImPyPyPy-y-ImPyPyPy-P-Py-p-Dp binds to the 8-bp match sequence 5'-TGTGAACA-3 7 with 
an equilibrium association constant of K a = 2.4 x 10 10 M" 1 and > 48-fold specificity versus the 
5'-TGTTAACA-3' single-base-pair mismatch site. 
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Table 26 Equilibrium association constants ( M" h for polyamides.°' c 



Polyamide 


5TGTTAACA-3' 


5 -TGTGAACA-3* 


Specificity 4 


♦OOOOOn 


2.5 x 10 9 


3.9 x 10* 


6 


•-OOOOOv 


6.6 x 10 9 


2.5 x 10 8 


26 


•o#oock 


5x 10 9 


5x 10 9 


1 




s5x 10 a 


£.4 x 10 10 


*48 



a Values reported for 1. 5. and 10 are the mean values obtained from 
three DNase 1 footprint titration experiments. 6 The assays were carried 
out at 22 °C at pH 7.0 in the presence of 10 raM Tris-HCl, 10 mM KC1, 
10 mM MgCi 2 t and 5 mM CaC^. c Match sice association constants 
and specificities higher than parent hairpins are shown in 
boldtype. d Specificity is calculated as K^match") / K a (mismatch). 

Modeling indicates that the p-alanine residue relaxes ligand curvature, providing for 
optimal hydrogen bond formation between the floor of the minor groove and both Im-residues 
5 within the Im-P-Im polyamide subunit. This observation provided the basis for design of a 
hairpin polyamide, Im-P-ImPy-y-Im-P-ImPy-P-Dp, which incorporates Im/p pairings to 
recognize a "problematic" S'-GCGC-S' sequence at subnanomolar concentrations. 

Table 27 Equilibrium association constamts (MT for poly amides. a ' b 



Polyamide 5'-TGCGCA-3' 5V-TGGCCA-3' S'-TGGGGA-B' 





3.7 x 10 7 


< 10 7 


< 10 7 




3.7 x I0 9 


1.4 x 10 8 


1.1 x 10* 



a Values reported are the mean values obuained from a minimum of three 
DNase I footprint titration experiments. ^The assays were carried out at 
22 °C at pH 7.0 in the presence of 10 mMI Tris-HCl," 10 mM KCK 10 mM 
MgCl 2 , and 5 mM CaCl 2 . 



10 These results identify Im/p and p/Im pairings that respectively discriminate G»C and 

OG from A # T/T«A as well as Py/p and p/Py pairings that discriminate A # T/T»A from 
G # C/C*G. These aliphatic/aromatic amino acid pairings will facilitate the design of hairpin 
polyamides which recognize both a larger binding site size as well as a more diverse sequence 
repertoire. 

15 

EXAMPLE 10: 
POLYAMIDE BIOTIN CONJUGATES 

Bifiinctional conjugates prepared between sequence specific DNA binding polyamides 
and biotin are useful for a variety of applications. First, such compounds can be readily attached 
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to a variety of matrices through the strong interaction of biotin with the protein streptavidin. 
Readily available strepdavidin-derivatized matrices include magnetic beads for separations as 
well as resins for chromatography. 

A number of such polyamide-biotin conjugates have been synthesized by solid phase 
synthetic methods outlined in detail above. Following resin cleavage with a variety of diamines, 
the polyamides were reacted with various biotin carboxylic acid derivatives to yield 
bifunctional conjugates. The bifunctional conjugates were purified by HPLC and characterized 
by MALDI-TOF mass spectroscopy and ! H NMR. 

The scheme for the synthesis of an exemplary biotin- polyamide conjugate is shown 

below. 




o 




The foregoing is intended to be illustrative of the present invention, but not limiting. 
Numerous variations and modifications of the present invention may be effected without 
departing from the true spirit and scope of the invention. 
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What is claimed is: 

1 . In a polyamide having at least three consecutive carboxamide pairs for 
binding to at least three DNA base pairs in the minor groove of a duplex 
DNA sequence having at least one A«T or T^A DNA base pair, the 
improvement comprising selecting a Hp/Py carboxamide pair to 
correspond to a T^A base pair in the minor groove of the duplex DNA 
sequence or selecting a Py/Hp carboxamide pair to bind to an A»T DNA 
base pair in the minor groove of the duplex DNA sequence. 

2. The polyamide of claim 1 wherein at least four consecutive carboxamide 
pairs bind to at least four DNA base pairs. 

3. The polyamide of claim 1 wherein at least five consecutive carboxamide 
pairs bind to at least five DNA base pairs. 

4. The polyamide of claim 1 wherein at least six consecutive carboxamide 
pairs bind to at least six DNA base pairs. 

5. The polyamide of claim 1 wherein the A»T or T»A base pair has a G»C 
or OG base pair on either side. 

6. The polyamide of claim 1 wherein the duplex DNA sequence is a 
regulatory sequence. 

7. The polyamide of claim 1 wherein the duplex DNA sequence is a 
promoter sequence. 

8. The polyamide of claim 1 wherein the duplex DNA sequence is a coding 
sequence, 

9. The polyamide of claim 1 wherein the duplex DNA sequence is a non- 
coding sequence. 

10. The polyamide of claim 1 wherein the binding of the carboxamide pairs 
to the DNA base pairs modulates the expression of a gene. 
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11. A composition comprising an effective amount of the polyamide of claim 
1 and a pharmologically suitable excipient. 

12. A diagnostic kit comprising the polyamide of claim 1 . 

13. A polyamide according to claim 1 having the formula: 

X1X2X3X4-Y-X5X6X7X8 
wherein 7 is -NH-CH2-CH2-CH2-CONH- hairpin linkage derived from 
y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X4/X5, X3/X6, X2/X7, and Xj/Xs represent 
carboxamide binding pairs which bind the DNA base pairs wherein at 
least one binding pair is Hp/Py or Py/Hp and the other binding pairs are 
selected from Py/Im Im/Py to correspond to the DNA base pair in the 
minor groove to be bound. 

14. The polyamide of claim 13 wherein there is at least one (3-alanine in a 
non- Hp containing binding pair. 

15. The polyamide of claim 13 wherein dimethylaminopropylamide is 
covalently bound to Xi or X8- 

16. A polyamide selected from those listed in Tables 9-24 as 
compounds 1 through 240. 

17. A polyamide selected from shown in Fig. 4. 
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** Hp i m 



o 

7 --si ° • 

1 ImJmPyPy-Y-liiiPyPyPy^-Sp ° 



o 

mi 

9 i 



2 ImlmPyPy-Y-ImHpP>P>-3-Dp ° 



o 



c 

I o 
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3 ImlmHpPy-Y-ImPyPyPy-p-Dp 
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5-T G O 



C A-3' 



3-A C C [aJG T-S 

Py/Py with T»A 




5-T G G A C A-3' 



3'-A C C 



G T-5* 



Py/Py with A»T 



5'-T G G 



C A-3* 



+ ><KX>L I 

3-A C C gy G T-5' 
Py/Hp with T«A 




5*-T G G S] C A-3' 

3'-A C C [rj G T-5* 
Py/Hp with A*T 



5*-T G G ff] C A-3' 

t^>ooc-r 

3-A C C [AJ G T-5* 

Hp/Py with T»A 



5 -T G G 
3 -A C C 



C A-3' 



G T-5' 



Hp/Py with A*T 



FIG. 2 



WO 98/37066 



PCT/US98/01006 



3/17 




FIG. 3 
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5'-T G T El A-3' 

3-A C AffiJT-S' 

K d = 0.20 \iM 



6-Ring Hairpin Hp-Py-Ira-Polyaroides 

5 -T G I*lT T-3 

3-A C By A A-5' 

K 6 = 0.28 [iM K 6 = 0.008 *iM 



5'-T GTS A-3' 
3'-A C A * T-5' 



5*-T G (Ai T T-3' 

3'-A C IXJ A A-5' 

AT d =0.33 \xM 



8-Ring Hairpin Hp-Py-Im-Polyamides 



5 -T G G [T C A-3' 

3'-A C C [AjG T-5' 

K d = 0.48 nM 



5 -T G G 
3 -A C C 



A C A-3' 



5-T 



TO T-5' 



K H =0.83 nM 




A-3' 



T-5* 



10-Ring Hairpin Hp-Py-Im-Polyamides 

5-T GSlAfAlc A-3' 5*-T G G 

3-A C [aJT [TJG T-5' 

K 6 = 0.2 nM 



3 -A C A 




A- d = 5oM 
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1M NaOH / EtOH 




White Solid 



FIG. 5 
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lmlmP>Py-vlmOpPyPy-P-Dp 




(2) ImlmPyPy-Y-ImHpPyPy-p-Dp 
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FIG. 10 
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l»J»*T*>-rl«HpPyPy.fWOi*KXITA«Pc<II) 1t0«M 



>2 



$-C TCATTOCACAaCQACCTCATACCTTGaCGTAATCATaCTCAT-3 
)-C ACTAACCTSTCGCTCCACTATCCAACCCCATTACTACCACTA-i 

i in 




telaIM>T->^> > r^-0-D|>-KDTA>Fc<n> IM«M 



■ < 

■ 
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ImPyPyPyPy-Y-lmPyPyPyPy-^Dp lmHpPyPyPj-T-imHpPyPyPy-fr-Dp 
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abed 
5' -w o t n w 

(DffHBSr 

3 * -W C A Jl M W-5» 



dt 



d-A 



d-« 



d-C 



5*-W O T T T W-3' 



5'-W O T T A W-3' 5-W OTTO W-3* 



3*-W O T T C W-3' 




3'-W C A A T W-5' 
i a i * » 4 t • 

2 ) Ima P B P IT-T-BpPT*7Py 



3-W C A A C 

1 1 I « 1*19 

3 ) iaBpBpI»-7-PyryPyP7 



3 '-If C A A O 

I S 1 4 * * 7 • 

4 ) xaflpapry-7-x«pyryPY 



5'-W 



A W-3* 



S'-W 



3-W C A T A W-5' 

l 2 J 4 S 4 7 • 



3-W C A T T W-5' 




5-W O T A C W-3' 



3-W CATC W-5' 

* . . , 1J14 1 4 7 t I 1 J 4 I * 7 • liJt *4tt 

S)lBHpPyBp-r-PyapPyPy 6 ) l*HpPyPy-Y-apBpryPy 7 ) imHpPy !»-7-PyBpPyPy 



3-W C A T O W-5 

12 14 * 4 7 t 

8) IaflpPyPy-r- 1 aBpPyPy 



5-W Q T O T W-3' 



9 ) imHpImflp-Y-PyPyPyPy 



5-W 




5-W O T O A W-3' 



3-W C A C T W-5* 

12 14 14 7» 

l o ) i»Bpl»tf>y-Y-apPyPyPy 



5-W O T O O W-3' 



3-W C A C C W-5' 

13 5 4 J 4 7 • 

11 >i9iHpimi»-Y-PypyPyPy 



5*-W O T O C W-3' 



3-W C AC O W-5" 
i l > « s 4 l • 
12 J iaflpIaPy-T- i^pyPyPy 



W-3' 



3-W 



1114 4 4 7 4 

l 3 > ifiHpPyHp-y-pyliiPyPy 




5 -W O 



O W-3' 




14 jlmHppVPy-T-BpI^yPy is > laflppy Ia-T-Py I*PyPy 



C W-3' 



3*-W C A O O W-5' 

1 3 1 4 14 7 4 

16 ) ImflpPyPy-7-l4vI«PyPy 



FIG. 14 



WO 98/37066 



PCT/US98/01006 



15/17 




3 1 -W C T Jf II W- 5 ' 



d-T 



d-A 



d-C 



5-W OAT* W-3' 



3-W C T A A W-3' 
13)4 1 c f • 



5-W Q A * A W-3' ! S-W Q A T O W-3* 



5*-W a A T C W-3' 



I 3 -M C X A T W-5* | 3-W C T A C W-S' 

11)* » 4 1 • ! 1214 J * 7 t 

IB ) iaPyHpPy-T-BpPyBpPy 1 19 ) l4ll>ySpl44-Y-PyPyBpPY 



3-W C T A O W-5' 

1 2 J 4 1 4 1 • 

20 ) l*PyBppy-T-tal»yHpPy 



S'-W 



3-W 




5-W 



T T A K-5 

1 I 1 4 14*4 

2 1 > InPyPyHp-T-PyHpBpPy 



5'-W 




W-3" 



5-W 




W-3' 



3-W C T T T W-5* 3'-* C T T C W-5' 3-„ „ * * ~ 

1114 14 14 1114 lilt 1234 4414 

22 >I*f>yPyPy-T-BpHpBpPy 1 23 ) I«PyP-yI«-y.pyBpapPy 24 UnPyPyPy-y- I*8p€pPy 



c-c 



5-W O A C T W-3' 



3-W C T C A W-3' 

12 14 14 14 

2S >I«PyIi»Hp-7-PyPyHpPy 



5-W O A C T W-3* 



5"-W GAGA W-3* 



5*-W O A Q Q W-3' 



3-W C t C T W-5* 3-W C T C C ¥-5' 

1224 4 4 T • 12)4 1 4 T 4 

26 > ImPylaPy-r-BpPyapPy t 27 ) ImPy I »J»-t- PyPyBpPy 



5-W O A C A W-3' 



I 



5-W O A C O W 



3 -W C T 0 A W-3' 3-W C I O T W-3* 3 -W C T 0 C W-3' 3 -W C f G G W-3* 

121* 14 7 4 1214 1414 J J 1 4 1 4 T 4 1214 4471 

2»)I»PyPyBp-T-Py raHpPy 30 ) I*PyPyPy-T-BpI»BpPy ■ 3 u IaPyPy Ia-T-Py laflpPy 32 ) I«PyPyPy-T-l4Um8pPy 




5-W G A 0 C W-3* 



3-W C T C G W-3' 

1 2 J 4 9 4 7 4 

28 > IftPylnj»y-y- 1 nPyBpPy 

S'-W G A C C W-3' 



FIG.15 



BNSOOCtO. «WO_9837066MJ_> 



WO 98/37066 



PCT/US98/01006 



16/17 



• bed 



3« -W C C H If W- 5* 



d-T 



d-A 



d-C 



5*-W GOT? W-3' 5*-W O O T A W-3' 5-W GOTO W-3* 3'-W O G T C W-3 



3'- 



3-W C C A A W-3' 

1 2 S « 1 4 1 • 

33 > i«i»Bpap-T-pypy*ypy 



3-W CCA? W-5' 

l J J 4 14 7 4 

34 ) xniBBppyY-BpPyPyPy 



W C C A C W-5' 

1114 1 < 7 « 

35 ) I*IafipX*-<r-PyPyPyPy 



3-W C C A O W-S' 

12 2 4 I 4 T • 

36 ) lnlaapPy-Y*l»PyPyI»y 



C-A 



C-O 



C-C 




3-W C C T A W-3' 

11)4 S 4 1 I 

37 ) i«j»i»yap-7*PyBpPyi , y 



5-W 



3-W C C T T W-3' 

1114 * 4 1 • 

38 >I»2»PyPy-7-BpepPyPy 




5 -W G Q A O W-3 




3'-W 



C W-3' 



3 -W c"? T* C* W-5' 3'-W C C T O W-5* 

1 I J 4 I 4 T • 1234 » 4 7 • 

39 ) InlnPyXa-r-PyBpPyPy 40 ) I»ImPyPy-7- I«flpPyPy 



5-K O G G T W-3' 



3-W C C C A W-5' 



5'-W O G O A W-3' 



3 -W C C C T W-5' 

12)4 14 7 4 



3 -W 0 O O 0 W-3* 5-W O O G C W-3* 



3-W C C C C W-5' 



1214 » 4 7 • ^ 

4 \ i iaiimiBBp-Y-PyPyPyPy 42 ) iai»i»Fy-Y-HpPyPyPy 



3-W C C C O W-5' 

1114 I 4 7 t 1214 J 4 7 • 

Gl )I*I»I»I»-T-Pypypypy C2 >i»imi»Py-Y-iaPypyPy 



S -W O O C T W-3' 



3 -W C C G A W-5' 

12 14 5 111 

43 )iai«PyBp-T-Pyt«Pypy 



5'-W 



3'-W C C G T W-3' 

1214 5 4 T 4 

44 >I»2»PyPy-T-BpI»PyPy 



S'-W G O C O W-3' 



5'-M O O C C W-3' 



3-W C C O C W-5' 



3-W C C G O W-3' 



1214 14ft 1224 SfTt 

G3 ) I»lmPyt»-7-PYl»l^yPy ) ImlmPyPy-T- ImlaPyPy 



FIG. 16 



BNSDOCID- <WO 983706^ 1 _L> 



WO 98/37066 




PCT/US98/01006 



17/17 



• bed 

O C W V V-3' 



3»-w c a w m w-5' 



d-T 



5*-W a C » T W-3' 



5'-W O C T A W-3' ! S-W O C T O W-3' 



3-W C O A A W-5' I 3-W COAT W-5' 



5'-K O C T C W-3' 



3-W C O A C W-5' 



1114 i • ? • 

45 )i«pyBpflf>-T-rf^rx*PTr 



1 J 3 4 1 < 7 • 1 1 J ♦ 1*71 

46 ) InPyBpFy-T-BppyXaPy 4 7 j ImPyBpIm-Y-PyPylmPy 



3 -W C O A <3 W-5* 

12 14 1 < T • 

48 ) XaPyappy-Y-i»PyiaPy 



C-A 



5-W O C A * W-3 



3'- 



5*-W O C A A W-3' 



5 -W O C A O W-3' 



W C O T A W-5' 

114 % 4 1 • 



4 9 )I*Pyl*yBp-T-P7*pI»P7 I 



3 -W C 0 T T W-5* 



1114 t f 7 • _ 

50 ) XaPyPyPy-T-BpSptBPy ; 51 ) laPyFyXB-r-PyBpXaPy 



3 -W COTC W-5' 

1114 $474 



5 -W a C A C W-3' 



3 -W C O X O W-5' 

1114 J 4 7 i 

52 ) iMpyPypy-T-XafiplftFy 



5-W O C 0 T W-3' 



3-W COCA W-5' 

1114 $474 

53 ) XMPyl4rip-T-PyPyX*Py 



5 -W O C O A W-3* 



3 -W C O C T W-5' 



5-W O C o Q W-3' 



5-W O C O C W-3' 



3-W C O C C W-5* 

1114 5 * 7 i 1114 S414 

i 54 ) lmPyI*Py-7-appyIaPy • G5 ) IaPy I »I»-7- PyPy I»Py G6 ) ImPylaPy-y- IaPylmPy 



3 -W C O C 0 W-5' 

1114 14 7 4 



5-W a C C * W-3' 



3 -W O C C A W-3' 



3'-W C O O A W-S* 

1114 1474 

55 )XnPyPyBp-r-PyI*ImPy 



5 -W O C C O W-3' 5 -W O C C C W-3' 



I 3 -W COOT W-5' I 3 -W C O O C W-5* 

' 1114 447t! 1134 I « 7 • 

j 56 )I*PyPyP7-7-HpImI»Py \ C7 ) XaPyPyI»-Y-Pyl4UaPy 



5 -W C O O O W-3' 

1 1 S 4 S 4 7 • 

G8 > XaPyPypy-?-iaiAlaPy 



FIG. 17 



BNSDOCID". <WO 9837066A1 I > 



INTERNA 



JAL SEARCH REPORT 



InU T^J^Ap plication No 

PCT/US 98/01006 



A. CLASSIFICATION OF SUBJECT MATTER , , , 

IPC 6 C07D207/34 C07D233/90 A61K31/415 C07D403/14 C1201/68 



According to International Patent Classification(iPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C07D A61K CUQ 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 1 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



J. W. TRAUGER ET AL: "Recognition of DNA 
by designed ligands at subnanomolar 
concentrations" 
NATURE, 

vol. 382, no. 6591, 8 August 1996, 
pages 559-561, XP002066256 
cited in the application 
see the whole document 

S.E. SWALLEY ET AL: "Recognition of a 
5'-(A,T)GGG(A,T)2-3' sequence in the minor 
groove of DNA by an eight-ring hairpin 
polyamide" 

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY , 
vol. 118, no. 35, 4 September 1996, 
pages 8198-8206, XP002066377 
see page 8198 - page 8202 

-/-- 



13-15,17 



13-15,17 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



° Special categories of cited documents : 

"A" document defining the general state of the art which is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
filing date 

"L" document which may throw doubts on priority ciaim(s) or 
which is cited to establish the publicationdate of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 



"T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand She principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken atone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

"&" document member of the same patent family 



Date of the actual completion of theinternational search 



28 May 1998 



Date of mailing of the international search report 



1 2. Oa 98 



Name and matting address of the ISA 

European Patent Office. P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 



Authorized officer 



Voyiazoglou, D 



Form PCT/lSA/210 (second sheet) (July 1992) 



page 1 of 2 



INTERN A) 



AL SEARCH REPORT 



ional Application No 

PCT/US 98/01006 



C.(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



A 

P,X 



P.X 



E . B. BAIRD ET AL: "Solid phase synthesis 
of polyamides containing imidazole and 
pyrrole amino acids" 

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 
vol. 118, no. 26, July 1996, 
pages 6141-6146, XP000674666 
cited in the application 
see page 6141 - page 6142 

M. E. PARKS ET AL: "Optimization of the 

hairpin polyamide design for recognition 

of the minor groove of DNA" 

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 

vol. 118, no. 26, July 1996, 

pages 6147-6152, XP000674668 

see page 6147 - page 6148 

M. E. PARKS ET AL : "Recognition of 
5'-(A,T)GG(AT)2-3' sequences in the minor 
groove of DNA by hairpin polyamides" 
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 
vol. 118, no. 26, July 1996, DC US, 
pages 6153-6159, XP000674667 
see page 6153 - page 6155 

WO 96 05196 A (PHARMACIA) 22 February 1996 
see claim 1 

S. E. SWALLEY ET AL : "Discrimination of 
5'-GGGG-3', and 5'-GGCC-3' sequences in 
the minor groove of DNA by eight-ring 
hairpin polyamides" 

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 
vol. 119, no. 30, 30 July 1997, DC US, 
pages 6953-5961, XP002066260 
see page 6959 - page 6961 

W. L. WALKER ET AL: "Estimation of the 
DNA sequence discriminatory ability of 
hairpin-linked lexitropsins" 
PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES, U.S. A. , 
vol. 94, no. 11, May 1997, 
pages 5634-5639, XP002066261 
see table 1 



13-15,17 



13-15,17 



13-15,17 



13-15,17 
13-15,17 



13-15,17 



Form PCT/1SA/210 (continuation ot secona sheet) (July 1992) 
■ <WO 9837066A1 J_> 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 



InU^^Hbat application No. 

~T/US 98/01006 



Box I Observations where certain laims were found unsearchable (Continuation of item 1 of first sheet) 

This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. | Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



Pn Claims Nos.: 1-12,16 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

The claims are so broad that for determining the scope of a neaningful 
international search due account has been taken of rule 33.3 PCT; special 
emphasis was put on the following subject-matter: claims 13-15,17; pages 1-34; 
examples 1-8 and figures 1-11 



□ 



Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box I! Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I I As alt required additional search fees were timely paid by the applicant, this International Search Report cavers all 
' ' searchable claims. 

2. [ ] As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. As only some of the required additional search fees were timely paid by the applicant, this International Search Report 

' ' covers only those claims for which fees were paid, specifically claims Nos. : 



4 - I l No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| j No protest accompanied the payment of additional search fees. 

Form PCT/ISA/210 (continuation of first sheet (1)) (July 1992) 



BNSOOCID: <WO . 9837066A1 I _> 



INTERN A^BNAL SEARCH REPORT 

Information on patent family members 



ional Appltcation No 

PCT/US 98/01006 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



AU 


689623 


B 


02-04-1998 


AU 


3113695 


A 


07-03-1996 


CA 


2172629 


A 


22-02-1996 


CN 


1131946 


A 


25-09-1996 


EP 


0722446 


A 


24-07-1996 


FI 


961506 


A 


05-06-1996 


HU 


76267 


A 


28-07-1997 


JP 


9504039 


T 


22-04-1997 


NO 


961377 


A 


30-05-1996 


NZ 


290404 


A 


24-04-1997 


PL 


313821 


A 


22-07-1996 


ZA 


9506590 


A 


18-03-1996 



WO 9605196 



22-02-1996 



Form PCT/ISA/210 (patent family annex) (July 1992) 
BNSOOCID: <WO_9837066A 1 _l_> 



CORRECTED 
VERSION* 



PCT 



# 



INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C07D 207/34, 233/90, A61K 31/415, 
C07D 403/14, C12Q 1/68 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 98/37066 

27 August 1998 (27.08.98) 



(21) International Application Number: PCT/US98/0 1 006 

(22) International Filing Date: 21 January 1998 (21.01.98) 



(30) Priority Data: 

PCT7US97/03332 
(34) Countries for 

international 
60/043,444 
60/042,022 
08/837,524 
08/853,522 
PCT/US97/ 12722 
(34) Countries for 

international 



20 February 1997 (20.02.97) WO 
which the regional or 

application was filed: US et al. 

8 April 1997 (08.04.97) US 

16 April 1997 (16.04.97) US 

21 April 1997 (21.04.97) US 
8 May 1997 (08.05.97) US 
21 July 1997 (21.07.97) WO 

which the regional or 

application was filed: US et al. 



(63) Related by Continuation (CON) or Continuation-in-Part 
(CIP) to Earlier Applications 

US 08/853,522 (CIP) 

Filed on 8 May 1997 (08.05.97) 

US 08/837,524 (CIP) 

Filed on 21 April 1997 (21.04.97) 

US 08/607,078 (CIP) 

Filed on 26 February 1996 (26.02.96) 

US 60/042,022 (CIP) 

Filed on 16 April 1997 (16.04.97) 



US 

Filed on 



60/043,444 (CIP) 
8 April 1997 (08.04.97) 



(71) Applicant (for all designated States except US): CALIFORNIA 

INSTITUTE OF TECHNOLOGY [US/US]; Pasadena, CA 
91125 (US), 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): BAIRD, Eldon, E. 
[US/US1; 255 S. Madison Avenue #5, Pasadena, CA 91 101 
(US). DERVAN, Peter, B. [US/US]; 1235 St. Albans Road, 
San Marino, CA 91 108 (US). 

(74) Agent: McDONNELL, John, J.; McDonnell Boehnen Hulbert 
& Berghoff, 300 South Wacker Drive, Chigago, IL 60606 
(US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, GW, HU, ID, IL, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MD, MG, MK, MN, MW, MX, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, TT, UA, UG T US, UZ, VN, YU, ZW, ARIPO patent 
(GH, GM, KE, LS, MW, SD, SZ, UG, ZW), Eurasian patent 
(AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent 
(AT, BE, CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 



(54) Title: IMPROVED POLYAMIDES FOR BINDING IN THE MINOR GROOVE OF DOUBLE STRANDED DNA 
(57) Abstract 

The invention encompasses improved polyamides for binding to specific nucleotide sequences in the minor groove of double 
stranded DNA. The 3-hydroxy-N-methylpyrrole/N-methylpyrrole carboxamide pair specifically recognizes the T.A base pair, while the 
N-methylpyrrole/3-hydroxy-N-methylpyrrole pair recognizes A.T nucleotide pairs. Similarly, an N-methylimidizole/N-methylpyrrole 
carboxamide pair specifically recognizes the G.C nucleotide pair, and the N-methylpyrroIe/N-methylimidizole carboxamide pair recognizes 
the C.G nucleotide pair. 



♦(Referred to in PCT Gazette No. 05/1999, Section II) 

BNSOOClp- <WO 9837066A1 JA> 



• 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


Z\V 


Zimbabwe 


CI 


Ctite d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SO 


Singapore 







BNSDOCID: <WO_9837066A1_IA> 



WO 98/37066 



PCT/US98/01006 



IMPROVED POLY AMIDES FOR BINDING IN THE MINOR 
GROOVE OF DOUBLE STRANDED DNA 

5 The U.S. Government has certain rights in this invention pursuant to Grant Nos. GM 

26453, 27681 and 47530 awarded by the National Institute of Health. 

CROSS REFERENCE TO RELATED APPLICATIONS 

10 This application is a continuation-in-part of PCT/US97/03332 filed February 20, 1997, 

Serial No. 08/853,522 filed May 8, 1997 and PCT/US 97/12722 filed July 21, 1997 which are 
continuation-in-part applications of Serial No. 08/837,524, filed April 21, 1997, Serial No. 
08/607,078, filed February 26, 1996, provisional application Serial No. 60/042,022, filed April 
16, 1997 and provisional application Serial No. 60/043,444, filed April 8, 1997. 

15 

BACKGROUND OF THE INVENTION 

Field of the Invention 

20 This invention relates to polyamides which bind to predetermined sequences in the 

minor groove of double stranded DNA. 

Description of the Related Art 

25 The design of synthetic ligands that read the information stored in the DNA double helix 

has been a long standing goal of chemistry. Cell-permeable small molecules which target 
predetermined DNA sequences are useful for the regulation of gene-expression. 
Oligodeoxynucleotides that recognize the major groove of double-helical DNA via triple-helix 
formation bind to a broad range of sequences with high affinity and specificity. Although 

30 oligonucleotides and their analogs have been shown to interfere with gene expression, the triple 
helix approach is limited to purine tracks and suffers from poor cellular uptake. The 
development of pairing rules for minor groove binding polyamides derived from N- 
methylpyrrole (Py) and N-methylimidazole (Im) amino acids provides another code to control 
sequence specificity. An Im/Py pair distinguishes G*C from OG and both of these from A*T 

35 or T*A base pairs. Wade, W.S., Mrksich, M. & Dervan, P.B. describes the design of peptides 
that bind in the minor groove of DNA at 5 , -(A,T)G(A J T)C(A,T)-3' sequences by a dimeric 
side-by-side motif. J. Am, Chem. Soc. 114, 8783-8794 (1992); Mrksich, M. et al describes 
antiparallel side-by-side motif for sequence specific-recognition in the minor groove of DNA by 
the designed peptide l-methylimidazole-2-carboxamidenetropsin. Proc. Natl. Acad. Sci. USA 

40 89, 7586-7590 (1992); Trauger, J.W., Baird, E. E. Dervan, P.B. describes the recognition of 
DNA by designed ligands at subnanomolar concentrations. Nature 382, 559-561 (1996). A 
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Py/Py pair specifies A«T from G»C but does not distinguish A»T from T»A, Pelton, J.G. & 
Wemmer, D.E. describes the structural characterization of a 2-1 distamycin A- 
d(CGCAAATTTGGC) complex by two-dimensional NMR. Proc. Natl. Acad. ScL USA 86, 
5723-5727 (1989); White, S., Baird, E. E. & Dervan, P.B. Describes the effects of the A«T/T«A 

5 degeneracy of pyrrole-imidazole polyamide recognition in the minor groove of DNA. 
Biochemistry 35, 6147-6152 (1996); White, S., Baird, E. E. & Dervan, P. B. describes the 
pairing rules for recognition in the minor groove of DNA by pyrrole-imidazole polyamides. 
Chem. & Biol. 4, 569-578 (1997); White, S., Baird, E. E. & Dervan, P.B. describes the 5'-3' N- 
C orientation preference for polyamide binding in the minor groove. In order to break this 

10 degeneracy, a new aromatic amino acid, 3-hydroxy-N-methylpyrroIe (Hp) incorporated into a 
polyamide and paired opposite Py, has been found to discriminate A»T from T#A. The 
replacement of a single hydrogen atom on the pyrrole with a hydroxy group in a Hp/Py pair 
regulates affinity and specificity of a polyamide by an order of magnitude. Utilizing Hp 
together with Py and Im in polyamides to form four aromatic amino acid pairs (Im/Py, Py/Im, 

15 Hp/Py, and Py/Hp) provides a code to distinguish all four Watson-Crick base pairs in the minor 
groove of DNA. 

SUMMARY OF THE INVENTION 

20 The invention encompasses improved polyamides for binding to the minor groove of 

double stranded ("duplex") DNA. The polyamides are in the form of a hairpin comprising two 
groups of at least three consecutive carboxamide residues, the two groups covalently linked by 
an aliphatic amino acid residue, preferably y-aminobutyric acid or 2,4 diaminobutyric acid, the 
consecutive carboxamide residues of the first group pairing in an antiparallel manner with the 

25 consecutive carboxamide residues of the second group in the minor groove of double stranded 
DNA. The improvement relates to the inclusion of a binding pair of Hp/Py carboxamides in the 
polyamide to bind to a T«A base pair in the minor groove of double stranded DNA or Py/Hp 
carboxamide binding pair in the polyamide to bind to an A«T base pair in the minor groove of 
double stranded DNA. The improved polyamides have at least three consecutive carboxamide 

30 pairs for binding to at least three DNA base pairs in the minor groove of a duplex DNA 
sequence that has at least one A«T or T»A DNA base pair, the improvement comprising 
selecting a Hp/Py carboxamide pair to correspond to a T»A base pair in the minor groove or a 
Py/Hp carboxamide pair to bind to an A«T DNA base pair in the minor groove. Preferably the 
binding of the carboxamide pairs to the DNA base pairs modulates the expression of a gene. 

35 

In one preferred embodiment, the polyamide includes at least four consecutive 
carboxamide pairs for binding to at least four base pairs in a duplex DNA sequence. In another 
preferred embodiment, the polyamide includes at least five consecutive carboxamide pairs for 
binding to at least five base pairs in a duplex DNA sequence. In yet another preferred 
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embodiment, the polyamide includes at least six consecutive carboxamide pairs for binding to at 
least six base pairs in a duplex DNA sequence. In one preferred embodiment, the improved 
polyamides have four carboxamide binding pairs that will distinguish A«T, T*A, OG and G«C 
base pairs in the minor groove of a duplex DNA sequence. The duplex DNA sequence can be a 
regulatory sequence, such as a promoter sequence or an enhancer sequence, or a gene sequence, 
such as a coding sequence or a non-coding sequence. Preferably, the duplex DNA sequence is a 
promoter sequence. 

The preparation and the use of polyamides for binding in the minor groove of double 
stranded DNA are extensively described in the art. This invention is an improvement of the 
existing technology that uses 3-hydroxy-N-methylpyrrole to provide carboxamide binding pairs 
for DNA binding polyamides. 

The invention encompasses polyamides having y-aminobutyric acid or a substituted y- 
aminobutyric acid to form a hairpin with a member of each carboxamide pairing on each side of 
it. Preferably the substituted y-aminobutyric acid is a chiral substituted y-aminobutyric acid 
such as (R)-2,4-diaminobutyric acid. In addition, the polyamides may contain an aliphatic 
amino acid residue, preferably a p-alanine residue, in place of a non-Hp carboxamide. The P- 
alanine residue is represented in formulas as p. The p-alanine residue becomes a member of a 
carboxamide binding pair. The invention further includes the substitution as a P«p binding pair 
for non-Hp containing binding pair. Thus, binding pairs in addition to the Hp/Py and Py/Hp are 
Im/p, p/Im, Py/p, p/Py, and p/p. 

The polyamides of the invention can have additional moieties attached covalently to the 
polyamide. Preferably the additional moieties are attached as substituents at the amino terminus 
of the polyamide, the carboxy terminus of the polyamide, or at a chiral (R)-2,4-diaminobutyric 
acid residue. Suitable additional moieties include a detectable labeling group such as a dye, 
biotin or a hapten. Other suitable additional moieties are DNA reactive moieties that provide 
for sequence specific cleavage of the duplex DNA. 

Brief Description of the Drawings 

Figure 1 illustrates the structure of polyamide 2*_and 3. 

Figure 2 illustrates the pairing of polyamides to DNA base pairs. 

Figure 3 illustrates the DNase footprint titration of compounds 2 and 3. 

Figure 4 illustrates a list of the structures of representative Hp containing polyamides. 

Figure 5 illustrates the synthesis of a protected Hp monomer for solid phase synthesis. 

Figure 6 illustrates the solid phase synthesis of polyamide 2. 

Figure 7 illustrates the 1H-NMR characterization of polyamide 2. 
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Figure 8 illustrates the Mass spectral characterization of polyamide 2. 

Figure.9 illustrates 1H-NMR characterization of synthesis purity. 

Figure 10 illustrates DNasel footprint titration experiment. 

Figure 1 1 illustrates the synthesis of Afunctional conjugate of polyamide 2. 
5 Figure 12 illustrates affinity cleaving evidence for oriented hairpin formation. 

Figure 13 illustrates increased sequence specificity of Hp/Py containing polyamides. 

Figure 14 illustrates 8-ring hairpin polyamides which target S'-WGTNNW^' sites. 

Figure 15 illustrates 8-ring hairpin polyamides which target 5'-WGANNW-3* sites. 

Figure 16 illustrates 8-ring hairpin polyamides which target 5'-WGGNNW-3* sites. 
10 Figure 17 illustrates 8-ring hairpin polyamides which target 5'-WGCNNW-3' sites. 

DETAILED DESCRIPTION OF THE INVENTION 

Within this application, unless otherwise stated, definitions of the terms and illustration 
15 of the techniques of this application may be found in any of several well-known references such 
as: Sambrook, J., et al., Molecular Cloning: A Laboratory* Manual, Cold Spring Harbor 
Laboratory Press (1989); Goeddel, D., ed., Gene Expression Technology, Methods in 
Enzymology, 185, Academic Press, San Diego, CA (1991); "Guide to Protein Purification" in 
Deutshcer, M.P., ed., Methods in Enzymology, Academic Press, San Diego, CA (1989); Innis, et 
20 al., PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, CA 
(1990); Freshney, R.I., Culture of Animal Cells: A Manual of Basic Technique, 2 nd Ed., Alan 
Liss, Inc. New York, NY (1987); Murray, E.J., ed., Gene Transfer and Expression Protocols, 
pp. 109-128, The Humana Press Inc., Clifton, NJ and Lewin, B., Genes VI, Oxford University 
Press, New York (1997). 

25 

For the purposes of this application, a promoter is a regulatory sequence of DNA that is 
involved in the binding of RNA polymerase to initiate transcription of a gene. A gene is a 
segment of DNA involved in producing a peptide, polypeptide or protein, including the coding 
region, non-coding regions preceding ("leader") and following ("trailer") the coding region, as 

30 well as intervening non-coding sequences ("introns") between individual coding segments 
("exons"). Coding refers to the representation of amino acids, start and stop signals in a three 
base "triplet" code. Promoters are often upstream (" '5 to") the transcription initiation site of 
the corresponding gene. Other regulatory sequences of DNA in addition to promoters are 
known, including sequences involved with the binding of transcription factors, including 

35 response elements that are the DNA sequences bound by inducible factors. Enhancers comprise 

4 
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yet another group of regulatory sequences of DNA that can increase the utilization of 
promoters, and can function in either orientation (5*-3' or 3'-5') and in any location (upstream 
or downstream) relative to the promoter. Preferably, the regulatory sequence has a positive 
activity, i.e., binding of an endogeneous ligand (e.g. a transcription factor) to the regulatory 
5 sequence increases transcription, thereby resulting in increased expression of the corresponding 
target gene. In such a case, interference with transcription by binding a polyamide to a 
regulatory sequence would reduce or abolish expression of a gene. 

The promoter may also include or be adjacent to a regulatory sequence known in the art 
10 as a silencer. A silencer sequence generally has a negative regulatory effect on expression of 
the gene. In such a case, expression of a gene may be increased directly by using a polyamide 
to prevent binding of a factor to a silencer regulatory sequence or indirectly, by using a 
polyamide to block transcription of a factor to a silencer regulatory sequence. 

15 It is to be understood that the polyamides of this invention bind to double stranded DNA 

in a sequence specific manner. The function of a segment of DNA of a given sequence, such as 
5'-TATAAA-3\ depends on its position relative to other functional regions in the DNA 
sequence. In this case, if the sequence S'-TATAAAO 1 on the coding strand of DNA is 
positioned about 30 base pairs upstream of the transcription start site, the sequence forms part 

20 of the promoter region (Lewin, Genes VI, pp. 831-835). On the other hand, if the sequence 5*- 
TATAAA-3' is downstream of the transcription start site in a coding region and in proper 
register with the reading frame, the sequence encodes the tyrosyl and lysyl amino acid residues 
(Lewin, Genes VI, pp. 213-215). 

25 

While not being held to one hypothesis, it is believed that the binding of the polyamides 
of this invention modulate gene expression by altering the binding of DNA binding proteins, 
such as RNA polymerase, transcription factors, TBF, TFIIIB and other proteins. The effect on 
gene expression of polyamide binding to a segment of double stranded DNA is believed to be 
30 related to the function, e.g., promoter, of that segment of DNA. 

It is to be understood by one skilled in the art that the improved polyamides of the 
present invention may bind to any of the above-described DNA sequences or any other 
sequence having a desired effect upon expression of a gene. In addition, U.S. Patent No. 

5 
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5,578,444 describes numerous promoter targeting sequences from which base pair sequences 
for targeting an improved polyamide of the present invention may be identified. 

It is generally understood by those skilled in the art that the basic structure of DNA in a 
5 living cell includes both major and a minor groove. For the purposes of describing the present 
invention, the minor groove is the narrow groove of DNA as illustrated in common molecular 
biology references such as Lewin, B., Genes VI, Oxford University Press, New York (1997). 

To affect gene expression in a cell, which may include causing an increase or a decrease 
in gene expression, a effective quantity of one or more polyamide is contacted with the cell and 
internalized by the cell. The cell may be contacted in vivo or in vitro. Effective extracellular 
concentrations of polyamides that can modulate gene expression range from about 10 
nanomolar to about 1 micromolar. Gottesfeld, J.M., et al, Nature 387 202-205 (1997). To 
determine effective amounts and concentrations of polyamides in vitro, a suitable number of 
cells is plated on tissue culture plates and various quantities of one or more polyamide are 
added to separate wells. Gene expression following exposure to a polyamide can be monitored 
in the cells or medium by detecting the amount of the protein gene product present as 
determined by various techniques utilizing specific antibodies, including ELISA and western 
blot. Alternatively, gene expression following exposure to a polyamide can be monitored by 
detecting the amount of messenger RNA present as determined by various techniques, including 
northern blot and RT-PCR. 

Similarly, to determine effective amounts and concentrations of polyamides for in vivo 
administration, a sample of body tissue or fluid, such as plasma, blood, urine, cerebrospinal 
25 fluid, saliva, or biopsy of skin, muscle, liver, brain or other appropriate tissue source is 
analyzed. Gene expression following exposure to a polyamide can be monitored by detecting 
the amount of the protein gene product present as determined by various techniques utilizing 
specific antibodies, including ELISA and western blot. Alternatively, gene expression 
following exposure to a polyamide can be monitored by the detecting the amount of messenger 
30 RNA present as determined by various techniques, including northern blot and RT-PCR. 

The polyamides of this invention may be formulated into diagnostic and therapeutic 
compositions for in vivo or in vitro use. Representative methods of formulation may be found 

6 
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in Remington: The Science and Practice of Pharmacy, 19th ed., Mack Publishing Co., Easton, 
PA (1995). 

For in vivo use, the polyamides may be incorporated into a physiologically acceptable 
5 pharmaceutical composition that is administered to a patient in need of treatment or an animal 
for medical or research purposes. The polyamide composition comprises pharmaceutical^ 
acceptable carriers, excipients, adjuvants, stabilizers, and vehicles. The composition may be in 
solid, liquid, gel, or aerosol form. The polyamide composition of the present invention may be 
administered in various dosage forms orally, parentally, by inhalation spray, rectally, or 
10 topically. The term parenteral as used herein includes, subcutaneous, intravenous, 
intramuscular, intrasternal, infusion techniques or intraperitoneally. 

The selection of the precise concentration, composition, and delivery regimen is 
influenced by, inter alia, the specific pharmacological properties of the particular selected 
15 compound, the intended use, the nature and severity of the condition being treated or diagnosed, 
the age, weight, gender, physical condition and mental acuity of the intended recipient as well 
as the route of administration. Such considerations are within the purview of the skilled artisan. 
Thus, the dosage regimen may vary widely, but can be determined routinely using standard 
methods. 

20 

Polyamides of the present invention are also useful for detecting the presence of double 
stranded DNA of a specific sequence for diagnostic or preparative purposes. The sample 
containing the double stranded DNA can be contacted by polyamide linked to a solid substrate, 
thereby isolating DNA comprising a desired sequence. Alternatively, polyamides linked to a 
25 suitable detectable marker, such as biotin, a hapten, a radioisotope or a dye molecule, can be 
contacted by a sample containing double stranded DNA. 

The design of bifunctional sequence specific DNA binding molecules requires the 
integration of two separate entities: recognition and functional activity. Polyamides that 
30 specifically bind with subnanomolar affinity to the minor groove of a predetermined sequence 
of double stranded DNA are linked to a functional molecule, providing the corresponding 
bifunctional conjugates useful in molecular biology, genomic sequencing, and human medicine. 
Polyamides of this invention can be conjugated to a variety of functional molecules, which can 
be independently chosen from but is not limited to arylboronic acids, biotins, polyhistidines 
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comprised from about 2 to 8 amino acids, haptens to which an antibody binds, solid phase 
supports, oligodeoxynucleotides, N-ethylnitrosourea, fluorescein, bromoacetamide, 
iodoacetamide, DL-a-lipoic acid, acridine, captothesin, pyrene, mitomycin, texas red, 
anthracene, anthrinilic acid, avidin, DAPI, isosulfan blue, malachite green, psoralen, ethyl red, 
5 4-(psoraen-8-yloxy)-butyrate, tartaric acid, (+)-ct-tocopheral, psoralen, EDTA, methidium, 
acridine, Ni(II)*Gly-Gly-His, TO, Dansyl, pyrene, N-bromoacetamide, and gold particles. Such 
bifiinctional polyamides are useful for DNA affinity capture, covalent DNA modification, 
oxidative DNA cleavage, DNA photocleavage. Such bifiinctional polyamides are useful for 
DNA detection by providing a polyamide linked to a detectable label. Detailed instructions for 
10 synthesis of such bifiinctional polyamides can be found in copending U.S. provisional 
application 60/043,444, the teachings of which are incorporated by reference. 

DNA complexed to a labeled polyamide can then be determined using the appropriate 
detection system as is well known to one skilled in the art. For example, DNA associated with 
15 a polyamide linked to biotin can be detected by a streptavidin / alkaline phosphatase system. 

The present invention also describes a diagnostic system, preferably in kit form, for 
assaying for the presence of the double stranded DNA sequence bound by the polyamide of this 
invention in a body sample, such brain tissue, cell suspensions or tissue sections, or body fluid 
20 samples such as CSF, blood, plasma or serum, where it is desirable to detect the presence, and 
preferably the amount, of the double stranded DNA sequence bound by the polyamide in the 
sample according to the diagnostic methods described herein. 

The diagnostic system includes, in an amount sufficient to perform at least one 
25 assay, a specific polyamide as a separately packaged reagent. Instructions for use of the 
packaged reagent(s) are also typically included. As used herein, the term "package" refers 
to a solid matrix or material such as glass, plastic (e.g., polyethylene, polypropylene or 
polycarbonate), paper, foil and the like capable of holding within fixed limits a polyamide of 
the present invention. Thus, for example, a package can be a glass vial used to contain 
30 milligram quantities of a contemplated polyamide or it can be a microliter plate well to which 
microgram quantities of a contemplated polypamide have been operatively affixed, i.e., linked 
so as to be capable of being bound by the target DNA sequence. "Instructions for use" typically 
include a tangible expression describing the reagent concentration or at least one assay method 
parameter such as the relative amounts of reagent and sample to be admixed, maintenance time 
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periods for reagent or sample admixtures, temperature, buffer conditions and the like. A 
diagnostic system of the present invention preferably also includes a detectable label and a 
detecting or indicating means capable of signaling the binding of the contemplated polyamide 
of the present invention to the target DNA sequence. As noted above, numerous detectable 
5 labels, such as biotin, and detecting or indicating means, such as enzyme-linked (direct or 
indirect) streptavidin, are well known in the art. 

Figure 1 shows representative structures of polyamides. ImlmPyPy-y-ImPyPyPy-P-Dp 
(1), ImlmPyPy-Y-ImHpPyPy-P-Dp (2), and ImlmHpPy-y-ImPyPyPy-p-Dp (3). (Hp - 3- 

10 hydroxy-N-methylpyrrole, Im = N-methylimidazole, Py = N-methylpyrrole, (3 = p-alanine, y = 
y-aminobutyric acid, Dp = Dimethylaminopropylamide). Polyamides were synthesized by solid 
phase methods using Boc-protected 3-methoxypyrrole. imidazole, and pyrrole aromatic amino 
acids, cleaved from the support by aminolysis, deprotected with sodium thiophenoxide, and 
purified by reversed phase HPLC. Baird, E. E. & Dervan, P. B. describes the solid phase 

15 synthesis of polyamides containing imidazole and pyrrole amino acids. J. Am. Chem, Soc. 118, 
6141-6146 (1996); also see PCT US 97/003332. The identity and purity of the polyamides 
were verified by f H NMR, analytical HPLC, and matrix-assisted laser-desorption ionization 
time-of- flight mass spectrometry (MALDI-TOF MS-monoisotopic): 1 1223.6 (1223.6 
calculated), 2 1239.6 (1239.6 calculated); 3 1239.6 (1239.6 calculated). 

20 

Figure 2 illustrates binding models for polyamides 1-3 in complex with 5'-TGGTCA-3' 
and 5'-TGGACA-3' (A#T and T«A in fourth position highlighted). Filled and unfilled circles 
represent imidazole and pyrrole rings respectively; circles containing an H represent 3- 
hydroxypyrrole, the curved line connecting the polyamide subunits represents y-aminobutyric 
25 acid, the diamond represents P-alanine, and the -r represents the positively charged 
dimethylaminopropylamide tail group. 

Figure 3 shows quantitative DNase I footprint titration experiments with polyamides 2 
and 3 on the 3' 32 P labeled 250-bp pJK6 EcoRI/Pvull restriction fragment. Lane 1, intact DNA; 
30 lanes 2-1 1 DNase I digestion products in the presence of 100, 50, 20, 10, 5, 2, 1, 0.5, 0.2, 0.1 
nM polyamide, respectively; lane 12, DNase I digestion products in the absence of polyamide; 
lane 13, adenine-specific chemical sequencing. Iverson, B. L. & Dervan, P. B. describes an 
adenine-specific DNA chemical sequencing reaction. Methods EnzymoL 15, 7823-7830 (1987). 
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All reactions were done in a total volume of 400 fiL. A polyamide stock solution or H 2 0 was 
added to an assay buffer containing radiolabeled restriction fragment, with the final solution 
conditions of 10 mM Tris-HCl, 10 mM KCl, 10 mM MgCl 2 , 5 mM CaCl 2 > pH 7.0. Solutions 
were allowed to equilibrate for 4-12 h at 22 °C before initiation of footprinting reactions. 
5 Footprinting reactions, separation of cleavage products, and data analysis were carried out as 
described. White, S., Baird, E. E. & Dervan, P. B. Effects of the A^^/T^A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA. Biochemistry 35, 6147- 
6152 (1996). 

10 Figure 4 shows the structure and equilibrium dissociation constant for numerous 

compounds of the present invention. Polyamides are shown in complex with their respective 
match site. Filled and unfilled circles represent imidazole (Im) and pyrrole (Py) rings, 
respectively; circles containing an H represent 3-hydroxypyrrole (Hp), the curved line 
connecting the polyamide subunits represents y-aminobutyric acid (y), the diamond represents 

15 P-alamne (p), and the + represents the positively charged dimethylaminopropylamide tail group 
(Dp). The equilibrium dissociation constants are the average values obtained from three DNase 
I footprint titration experiments. The standard deviation for each set is less than 15% of the 
reported number. Assays were carried out in the presence of 10 mM Tris^HCl, 10 mM KC1, 10 
mM MgCl 2 , and 5 mM CaCl 2 at pH 7.0 and 22°C. 

20 

Figure 5 shows the synthetic scheme for 3-O-methyl-N-Boc protected pyrrole-2- 
carboxylate. The hydroxypyrrole monoester can be prepared in 0.5 kg quantity using published 
procedures on enlarged scale. 

25 Figure 6 shows the solid phase synthetic scheme for ImlmPyPy-y-ImHpPyPy-P-Dp 

starting from commercially available Boc-p-Pam-Resin: (i) 80% TFA/DCM, 0.4 M PhSH; (ii) 
Boc-Py-OBt, DIEA, DMF; (iii) 80% TFA/DCM, 0.4 M PhSH; (iv) Boc-Py-OBt, DIEA, DMF; 
(v) 80% TFA/DCM, 0.4 M PhSH; (vi) Boc-3-OMe-Py-OH, HBTU. DMF, DIEA; (vii) 80% 
TFA/DCM, 0.4 M PhSH; (viii) Boc-Im-OH, DCC, HOBt; (ix) 80% TFA/DCM, 0.4 M PhSH; 

30 (x) Boc-y-aminobutync acid, DIEA, DMF; (xi) 80% TFA/DCM, 0.4 M PhSH; (xii) Boc-Py- 
OBt, DIEA, DMF; (xiii) 80% TFA/DCM, 0.4 M PhSH; (xiv) Boc-Py-OBt, DMF, DIEA; (xv) 
80% TFA/DCM, 0.4 M PhSH; (vxi) Boc-Im-OH, DCC, HOBt (xvii) 80% TFA/DCM, 0.4 M 
PhSH; (xviii) imidazole-2-carboxylic acid, HBTU, DIEA; (xviv) dimethylaminopropylamine, 
55 °C, 18h. Purification by reversed phase HPLC provides ImlmPyPy-y-ImOpPyPy-p-Dp. (Op 

35 = 3-methoxypyrrole). Treatment of the 3-methyoxypyrrole polyamide with thiophenol, NaH, 
DMF, at 100 °C for 120 min provides polyamide 2 after reverse phase HPLC purification. 

10 
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Figure 7 shows the aromatic region from 7-11 ppm for the 1H-NMR spectrum 
determined at 300 MHz for ImlmPyPy-y-ImOpPyPy-p-Dp and ImlmPyPy-y-ImHpPyPy-p-Dp. 
This region of the spectrum may be used to determine compound identity and purity. 

Figure 8 shows the MALDI-TOF mass spectrum determined in positive ion mode with a 
monoisotopic detector for the polyamides for IndmPyPy-y-ImOpPyPy-P-Dp and ImlmPyPy-y- 
ImHpPyPy-P-Dp. This spectrum may be used to determine compound identity and purity. 

Figure 9 shows the methyl group region from 3.5-4.0 ppm for the 1H-NMR spectrum 
determined at 300 MHz for ImPyPy-y-OpPyPy-p-Dp and ImPyPy-y-HpPyPy-P-Dp. This region 
of the spectrum may be used to directly follow the progress for conversion of 3-methoxypyrrole 
to 3-hydroxypyrrole. 

Fig. 10 shows quantitative DNase I footprint titration experiments with the polyamides 
ImPyPy-y-PyHpPy-p-Dp and ImHpPy-y-PyPyPy-P-Dp on the 3'- 32 P labeled 3 70-bp pDEHl 
EcoKUPvtdl restriction fragment. Intact lane, labeled restriction fragment no polyamide or 
DNase I added; lanes 1-10, DNase I digestion products in the presence of 10 ^iM, 5 jiM, 2 |iM, 
1 |iM, 500 nM, 200 nM, 100 nM, 50 nM, 20 nM, 10 nM ImPyPy-y-PyPyPy-p-Dp, respectively 
or 1 nM, 500 nM, 200 nM, 100 nM, 50 nM, 20 nM, 10 nM, 5 nM, 2 nM, I nM ImHpPy-y- 
PyPyPy-p-Dp, respectively; DNase I lane, DNase I digestion products in the absence of 
polyamide; A lane, adenine-specific chemical sequencing. Iverson, B. L. & Dervan, P. B. 
describes an adenine-specific DNA chemical sequencing reaction. Methods Enzymol. 15, 7823- 
7830 (1987). All reactions were done in a total volume of 40 \xL, A polyamide stock solution 
or H 2 0 was added to an assay buffer containing radiolabeled restriction fragment, with the final 
solution conditions of 10 mM Tris-HCl. 10 mM KC1, 10 mM MgCl 2 , 5 mM CaCl 2 , pH 7.0. 
Solutions were allowed to equilibrate for 4-12 h at 22 °C before initiation of footprinting 
reactions. Footprinting reactions, separation of cleavage products, and data analysis were 
carried out as described. White, S., Baird, E. E. & Dervan, P. describe the pairing rules for 
recognition in the minor groove of DNA by pyrrole-imidazole polyamides. Chemistry & 
Biology 4, 569-578 (1997). 

Figure 1 1 shows the synthesis of a Afunctional polyamide which incorporates the Hp/Py 
pair. Treatment of a sample of ImlmPyPy-y-ImHpPyPy-P-Pam-resin (see Figure 6) with 3,3'- 
diamino-A^-methyldipropylamine, 55°C, 18 h followed by reverse phase HPLC purification 
provides the Op polyamide with a free primary amine group which can be coupled to an 
activated carboxylic acid derivative. Treatment with (i) EDTA-dianhydride, DMSO/NMP, 
DIEA, 55 °C; (ii) 0.1M NaOH, followed by reverse phase HPLC purification provides the Op- 
Py-Im-polyamide-EDTA conjugate. Treatment of the 3-methyoxypyrrole polyamide with 
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thiophenol, NaH, DMF, at 100 °C for 120 min provides polyamide 2 after reverse phase HPLC 
purification. 

Figure 12 shows the determination of the binding orientation of hairpin polyamides 
ImlmPyPy-y-ImHpPyPy-P-Dp-EDTA'Fe(II) 2-E»Fe(II) and ImlmHpPy-y-ImPyPyPy-p-Dp- 
EDTA*Fe(II) 3-E»Fe(II) by affinity cleaving footprint titration. Top and bottom left: Affinity 
cleavage experiments on a 3' 32 P labeled 250-bp pJK6 EcoRl/ Pvu II restriction fragment. The 
5'-TGGACA-3* and 5'-TGGTCA-3' sites are shown on the right side of the autoradiogram. 
Top left: lane 1, adenine-specific chemical sequencing reaction; lanes 2-6, 6.5 \iM, 1.0 nM, 
100 tiM, 10 nM, 1 nM polyamide 2-E»Fe(II); lane 7, intact restriction fragment, no polyamide 
added. Bottom left: lane 1, A reaction; lanes 2-6, 8.5 \iM, 1.0 JliM, 100 nM, 10 nM, 1 nM 
polyamide 3-E»Fe(II); lane 7, intact DNA. All reactions were carried out in a total volume of 
40 nL. A stock solution of polyamide or H 2 0 was added to a solution containing 20 kcpm 
labeled restriction fragment, affording final solution conditions of 25 mM Tris-Acetate, 20 mM 
NaCl, 100 \iW bp calf thymus DNA, at pH 7.0. Solutions were allowed to equilibrate for a 
minimum of 4 h at 22°K before initiation of reactions. Affinity cleavage reactions were carried 
out as described White, S., Batrd, E.E. & Dervan, P.B. Effects of the A»T/T»A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA. Biochemistry 35, 6147- 
6152 (1996). Top and bottom right: Affinity cleavage patterns of 2-E*Fe(II) and 3-E*Fe(H) at 
100 nM bound to 5 , -TGGACA-3' and 5'-TGGTCA-3\ Bar heights are proportional to the 
relative cleavage intensities at each base pair. Shaded and nonshaded circles denote imidazole 
and pyrrole carboxamides, respectively. Nonshaded diamonds represent the p-alanine moiety. 
A curved line represents the y-aminobutyric acid, and the + represents the positively charged 
dimethylaminopropylamide tail group. The boxed Fe denotes the EDTA«Fe(II) cleavage 
moiety. 

Figure 13 shows quantitative DNase I footprint titration experiments with the 
polyamides ImPyPyPyPy-y-ImPyPyPyPy-(i-Dp and ImHpPyPyPy-y-ImHpPyPyPy-p-Dp on the 
3' 32 P labeled 252-bp pJK7 EcoRU Pvu II restriction fragment. For ImPyPyPyPy-y- 
ImPyPyPyPy-p-Dp gel (left): lane 1, DNase I digestion products in the absence of polyamide; 
lanes 2-18, DNase I digestion products in the presence of 1.0 ^iM, 500 nM, 200, 100, 65, 40, 25, 
15, 10, 6.5, 4.0, 2.5, 1.5, 1.0, 0.5, 0.2, 0.1 nM polyamide; lane 19, DNase I digestion products in 
the absence of polyamide; lane 20, intact restriction fragment; lane 21, guanine-specific 
chemical sequencing reaction; lane 22, adenine-specific chemical sequencing reaction. For 
ImHpPyPyPy-y-ImHpPyPyPy-P'Dp gel (right): lane 1, intact DNA; lane 2, DNase I digestion 
products in the absence of polyamide; lanes 3-19, 1.0 ]iM, 500 nM, 200, 100, 50, 20, 10, 5, 2, 1, 
0.5, 0.2. 0.1, 0.05, 0.01, 0.005, 0.001 nM polyamide; lane 20, DNase I digestion products in the 
absence of polyamide; lane 21, A reaction. All reactions were done in a total volume of 400 
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\iL. A polyamide stock solution or H 2 0 was added to an assay buffer containing radiolabeled 
restriction fragment, with the final solution conditions of 10 mM Tris-HCl, 10 mM KC1, 10 mM 
MgCl 2 , 5 mM CaCl 2 , pH 7.0. Solutions were allowed to equilibrate for 4-12 h at 22°C before 
initiation of footprinting reactions. Footprinting reactions, separation of cleavage products, and 
data analysis were carried as described. White, S., Baird, E.E. & Dervan, P.B. Effects of the 
A*T/T»A degeneracy of pyrrole-imidazole polyamide recognition in the minor groove of DNA. 
Biochemistry 35, 6147-6152 (1996). 

Fig. 14 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula XiX 2 X 3 X4-Y-X 5 X 6 X7X8 wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X 4 /X 5 , X 3 /X 6 , X 2 /X 7 , and X,/X 8 represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGTNNW-3\ where the 5'-GTNN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5 , -WGTNNW-3' sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 

Fig. 15 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula X l X 2 X 3 X 4 -Y-X 5 X 6 X 7 X 8 wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X 4 /X 5 , X 3 /X 6 , X 2 /X 7 , and X^Xg represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGANNW-3', where the 5'-GANN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5*-WGANNW-3* sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 
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Fig. 16 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula X 1 X 2 X 3 X4-y-X 5 X 6 X 7 X 8 wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 

5 diaminobutyric acid; X4/X5, X 3 /X 6 , X 2 /X 7 , and X x /X% represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5*- 
WGGNNW-3\ where the 5'-GGNN-3' core sequence is defined as position a, b» c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 

10 minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5*-WGGNNW-3' sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 

15 

Fig. 17 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula X,X 2 X3X4-y-X 5 X 6 X 7 X 8 wherein y is the -NH-CH 2 -CH r CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 

20 diaminobutyric acid; X 4 /X 5) X 3 /X 6 , X 2 /X 7 , and Xj/Xg represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGCNNW-3\ where the 5'-GCNN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 

25 minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 S'-WGCNNWO' sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 

30 

Four-ring polyamide subunits, covalently coupled to form eight-ring hairpin structures, 
bind specifically to 6-bp target sequences at subnanomolar concentrations. Trauger, J.W., 
Baird, E. E. & Dervan, P.B. describe the recognition of DNA by designed ligands at 
subnanomolar concentrations. Nature 382, 559-561 (1996); Swalley, S. E., Baird, E. E. & 
35 Dervan, P. B. describe the discrimination of 5'-GGGG-3\ 5'-GCGC-3\ and 5'-GGCC'3' 
sequences in the minor groove of DNA by eight-ring hairpin polyamides. J. Am. Chem. Soc. 
119, 6953-6961 (1997). The DNA-binding affinities of three eight-ring hairpin polyamides 
shown in Figure 1 as compound 1, 2, and 3 containing pairings of Im/Py, Py/Im opposite G«C, 
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OG and either Py/Py, Hp/Py, or Py/Hp at a common single point opposite T»A and A»T has 
been determined. Equilibrium dissociation constants (K^) for ImlmPyPy-y-ImPyPyPy-P-Dp 1, 
ImlmPyPy-y-ImHpPyPy-P-Dp 2, ImlmHpPy-Y-ImPyPyPy-p-Dp 3 of Figure 1 are shown in 
Table 1. Brenowitz, M., Senear, D. F., Shea, M. A. & Ackers, G. K. describe a quantitative 
DNase footprint titration method for studying protein-DNA interactions. Methods EnzymoL 
130, 132-181 (1986); The values were determined by quantitative DNase I footprint 
titration experiments: on a 3' 32 P-labeled 250-bp DNA fragment containing the target sites, 5'- 
TGGACA-3' and 5'-TGGTCA-3' which differ by a single A»T base pair in the fourth position. 
The DNase footprint gels are shown in Figure 3. 



TABLE 1 Equilibrium dissociation constants* 

Polyamidet 5'-TGGTCA*3' 5'-TGGACA-3' *rel* 



S-T G Gfl 

1 Py/Py +XKXX 
3'-A c cy 

K d =0.07 


elc A-3' 
l]G T-5' 

7nM 


5'-T G Gffl 

+x>ooc 

3 -A C C [I 

AT d =0.15 i 


He A-3' 

3 J) 2.0 
[j O T-5' 

iM 


5 -T G G fl 
3 v 3'-A C C [J 

K d = 15 n 


He A-3' 
i]G T-5* 

M 


5*-T G G [j 

3'-A C C\2 

K d = 0.83 


□ C A-3' 

5^ 0.06 

P|G T-5' 

nM 


5*-T G Gil 
#"#■( 

3 Hp/Py +XKX)( 
3'-A C C[J 

ff d =0.48 


flC A-3' 

52> 

yG T-5' 
nM 


5*-T 6 6(j 

+XKX>C 

3 ' -A C C 

AT d =37 n\ 


SlC A-3' 

77 

cjG T-5' 
1 



*The reported dissociation constants are the average values obtained from three 
DNase I footprint titration experiments. The standard deviation tor each data set is 
less than 15% of the reported number. Assays were carried out in the presence of 10 
mM Tris'HCl, 10 mM KC1, 10 mM MgCU, and 5 mM CaCl 2 at pH 7.0 and 22 °C. 
tRing pairing opposite T»A and A»T in the fourth position. 
tCalculated as K d (5'-TGGACA-3')/ K d (5'-TGGTC A-3'). 

Based on the pairing rules for polyamide-DNA complexes both of these sequences are a 
match for control polyamide 1 which places a Py/Py pairing opposite 

A«T and T*A at both sites. It was determined that in polyamide 1 (Py/Py) binds to 5*- 
TGGICA-3' and 5'-TGGACA-3' within a factor of 2 (K^ = 0.077 or 0.15 nM respectively). In 
contrast, polyamide 2 (Py/Hp) binds to S'-TGGTCA^ and 5'-TGGACA-3' with dissociation 
constants which differ by a factor of 18 (IQ =15 nM and 0.83 nM respectively). By reversing 
the pairing in polyamide 3 (Hp/Py) the dissociation constants differ again in the opposite 
direction by a factor of 77 (K^ = 0.48 nM and 37 nM respectively. Control experiments 
performed on separate DNA fragments; reveal that neither a 5 , -TGGGCA-3' or a 5'-TGGCCA- 
3' site is bound by polyamide 2 or 3 at concentrations < 100 nM, indicating that the Hp/Py and 
Py/Hp ring pairings do not bind opposite G«C or C«G. The A»T vs. T»A discrimination is 
achieved preferably when the two neighboring base pairs are G»C and OG (GTC vs. GAC). 
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The specificity of polyamides 2 and 3 for sites which differ by a single A»T/T«A base 
pair results from small chemical changes. Replacing the Py/Py pair in 1 with a Py/Hp pairing 
as in 2, a single substitution of C3-OH for C3-H, destabilizes interaction with S^TGGTCA-3 1 
by 191 -fold, a free energy difference of 3.1 kcal mol* 1 . Interaction of 2 with 5~-TGGACA-3' is 
5 destabilized only 6-fold relative to 1, a free energy difference of 1.1 kcal mol' 1 . Similarly, 
replacing the Py/Py pair in 1 with Hp/Py as in 3 destabilizes interaction with 5'-TGGACA-3" 
by 252-fold, a free energy difference of 3.2 kcal mol' 1 . Interaction of 3 with 5TGGTCA-3' is 
destabilized only 6- fold relative to 1 , a free energy difference of 1 .0 kcal mol" 1 . 

10 The polyamides of this invention provide for coded targeting of predetermined DNA 

sequences with affinity and specificity comparable to sequence-specific DNA binding proteins. 
Hp, Im, and Py polyamides complete the minor groove recognition code using three aromatic 
amino acids which combine to form four ring pairings (Im/Py, Py/Im, Hp/Py, and Py/Hp) which 
complement the four Watson-Crick base pairs, as shown in TABLE 2. There are a possible 240 

15 four base pair sequences which contain at least 1 A«T or T*A base pair and therefore can 
advantageously use an Hp/Py, or Py/Hp carboxamide binding. Polyamides binding to any of 
these sequences can be designed in accordance with the code of TABLE 2. 

TABLE 2 Pairing code for minor groove recognition'* 
Pair G^C OG T«A A*T 

Im/Py 4- 

Py/Im + 

Hp/Py + 

Py/Hp + 
* favored (+), disfavored (-) 

20 
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For certain G*C rich sequences the affinity of polyamide*DNA complexes may be 
enhanced by substitution of an Im/p pair for Im/Py at G*C and p/Im for Py/Im at OG. At A»T 
and T^A base pairs, either a Py/p, p/Py, and p/p may be used. The alternate aliphatic/aromatic 
amino acid pairing code is described in Table 3. 

5 



TABLE 3 Aliphatic/ Aromatic substitution for ring 
pairings* 



Pair 


Substitution 


Im/Py 


Im/p 


Py/Im 


p/Im 


Hp/Py 


Py/p, p/Py, Hp/p, p/p 


Py/Hp 


Py/p, p/Py, p/Hp, p/p 



U. S. Patent 5,578,444 describes numerous promoter region targeting sequences from 
which base pair sequences for targeting a polyamide can be identified. 

10 

PCT U.S. 97/003332 describes methods for synthesis of polyamides which are suitable 
for preparing polyamides of this invention. The use of P-alanine in place of a pyrrole amino 
acid in the synthetic methods provides aromatic/aliphatic pairing (Im/p, p/Im, Py/p, and p/Py) 
and aliphatic/aliphatic pairing (P/P) substitution. The use of y-aminobutyric acid, or a 
15 substituted y-aminobutyric acid such as (R)-2,4 diaminobutyric acid, provides for preferred 
hairpin turns. The following examples illustrate the synthesis of polyamides of the present 
invention. 
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Example 1: 

PREPARATION OF A PROTECTED Hp MONOMER FOR SOLID PHASE 

SYNTHESIS. 

Distamycin and its analogs have previously been considered targets of traditional 

5 multistep synthetic chemistry. Arcamone, F., Orezzi, P. G., Barbieri, W., Nicolella, V. & Penco, 
S. describe a solution phase synthesis of distamycin Gazz. Chim. Ital. 1967, 97, 1097. The 
repeating amide of distamycin is formed from an aromatic carboxylic acid and an aromatic 
amine. The aromatic acid is often unstable to decarboxylation, and the aromatic amines have 
been found to be air and light sensitive. Lown, J. W. & Krowicki, K. describe a solution phase 

10 synthesis of Distamycin J. Org. Chem. 1985, 50, 3774. The variable coupling yields, long 
reaction times (often >24 h), numerous side products, and reactive intermediates (acid chlorides 
and trichloro ketones) characteristic of the traditional solution phase coupling reactions make 
the synthesis of the aromatic carboxamides problematic. B. Merrifield describes the solid phase 
synthesis of a tetrapeptide J. Am. Chem. Soc. 1963, 85, 2149. In order to implement an efficient 

15 solid phase methodology for the synthesis of the pyrrole- imidazole polyamides, the following 
components were developed: (1) a synthesis which provides large quantities of appropriately 
protected monomer or dimer building blocks in high purity, (2) optimized protocols for forming 
an amide in high yield from a support-bound aromatic amine and an aromatic carboxylic acid, 
(3) methods for monitoring reactions on the solid support, and (4) a stable resin linkage agent 

20 that can be cleaved in high yield upon completion of the synthesis, Baird, E. E. & Dervan, P. B. 
describes the solid phase synthesis of polyamides containing imidazole and pyrrole amino 
acids. J. Am. Chem. Soc. 118, 6141-6146 (1996); also see PCT US 97/003332. In order to 
prepare polyamides which contain the 3-hydroxypyrrole monomer, a synthesis has been 
developed which allows the appropriately protected Boc-Op acid monomer to be prepared on 50 

25 g scale. *H NMR and 13 C NMR spectra were recorded on a General Electric-QE 300 NMR 
spectrometer in CD3OD or DMSO-J6> with chemical shifts reported in parts per million relative 
to residual CHD2OD or DMSO-rfs, respectively. IR spectra were recorded on a Perkin-Elmer 
FTIR spectrometer. High-resolution mass spectra were recorded using fast atom bombardment 
(FABMS) techniques at the Mass Spectrometry Laboratory at the University of California, 

30 Riverside. Reactions were executed under an inert argon atmosphere. Reagent grade chemicals 
were used as received unless otherwise noted. Still, W. C, Kahn, M. & Mitra, A. describe flash 
column chromatography J. Org. Chem. 1978, 40, 2923-2925. Flash chromatography was 
carried out using EM science Kieselgel 60 (230-400) mesh. Thin-layer chromatography was 
performed on EM Reagents silica gel plates (0.5 mm thickness). All compounds were 

35 visualized with short-wave ultraviolet light. 
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Table 4 intermediates for preparation of Boc-protected 3-methoxypyrrole 



NAME 



STRUCTURE 



Ethyl 4-carboxy-3-hydroxy-l- 
methylpyrrole-2-carboxylate. 



Ethyl 4-[(Benzyloxycarbonyl)amino]-3- 
hydroxy-l-methylpyrrole-2-carboxylate 



Ethyl 4-[(Benzyloxycarbonyl)amino]-3- 
methoxy-l-methylpyrrole-2-carboxylate 





6 Y ^ 



Ethyl 4-[(tert-Butyloxycarbonyl)amino]-3- 
methoxy-l-methylpyrrole-2-carboxylate 



4-[(tert-Butyloxycarbonyl)amino]-3-methoxy 
-l-methylpyrrole-2-carboxylic add 




Ethvl 4-[(benzyloxycarbonyl)amino]'3-hydroxy-l-methylpy Ethyl-4- 
carboxy-3-hydroxy-l-methylpyrrole-2-carboxylate (60 g, 281.7 mmol) was dissolved in 282 
mL acetonitrile. TEA (28.53 g, 282 mmol) was added, followed by diphenylphosphorylazide 
(77.61 g, 282 mmol). The mixture was refluxed for 5 hours, followed by addition of benzyl 
alcohol (270 ml) and reflux continued overnight. The solution was cooled and volitiles 
removed in vacuo. The residue was absorbed onto silca and chromatagraphed, 4:1 hexanes : 
ethyl acetate, to give a white solid (21.58 g, 24%) *H NMR (DMSO-d6) 5 8.73 (s, 1H), 8.31 (s, 
1H), 7.31 (m, 5H), 6.96 (s, 1H), 5.08 (s, 2H), 4.21 (q, 2H, J = 7.1 Hz), 3.66 (s, 3H), 1.25 (t, 3H, 
J = 7.1 Hz); MS m/e 319.163 (M+H 319.122 calcd. for C16H1SN2O5). 



Ethyl 4-f(tert'butoxycarbonyl)aminoJ-3-methoxy~I~methylpyrrole-2- carboxylate. Ethyl 
4-[(benzyloxycarbonyl)amino]-3-hydroxy-l-methylpyTrole-2-carboxylate (13.4 g, 42.3 mmol) 
was dissolved in 110 mL acetone. Anhydrous K2CO3 (11.67 g, 84.5 mmol) was added, 
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followed by methyliodide (5.96 g, 42.3 mmol) and dimethylaminopyridine (0.5 g, 4,23 mmol) 
and the mixture stirred overnight. The solid K2CO3 was removed by filtration and 200 ml 
water added. Volitiles were removed in vacuo and the solution made acidic with addition of IN 
H2SO4 . The aqueous layer was extracted with diethyl ether. Organic layers were combined, 

5 washed with 10% H2SO4, dried over MgS04, and dried to give a white solid. The solid was 
used without further purification and dissolved in 38 ml DMF. DIEA (11 ml), Boc anhydride 
(9.23 g, 42.3 mmol), and 10 % Pd/C (500 mg) were added and the solution stirred under 
hydrogen (1 atm) for 2.1 h. The slurry was filtered through celite which was washed with 
methanol. Water 250 ml was added and volitiles removed in vacuo. The aqueous layer was 

10 extracted with ether. Organic layers were combined, washed with water and brine, and dried 
over MgS04. Solvent was removed in vacuo to give a white solid ( 8.94 g, 71%) *H NMR 
(DMSO-d6) 5 8.43 (s, 1H), 7.03 (s, 1H), 4.19 (q, 2H, J = 7.1 Hz), 3.70 (s, 3H), 3.67 (s, 3H), 
1.42 (s, 9H), 1.26 (t, 3H, J = 7.1); MS m/e 299.161 (M+H 299.153 calcd. for C14H22N2O5). 

15 Ethyl 4-[(benzyloxycarbonyl)amino]-3-hydroxy-l-methylpy Ethyl 4- 

[(/er/-butoxycarbonyl)amino]-3-methoxy- 1 -methylpyrrole-2-carboxylate (9.0 g, 30.2 mmol) 
was dissolved in 30 mL ethanol. NaOH (30 ml, 1 M, aq) was added and the solution stirred for 
4 days. Water (200 ml) was added and ethanol removed in vacuo. The solution was extracted 
with diethyl ether, aqueous layer acidified to pH = 2-3, and extracted again with diethyl ether. 

20 Organic layers were dried over MgS04, and solvent removed in vacuo to give a white solid (6.0 
g, 20.5 mmol, 87% based on recovered SM) ] H NMR (DMSO-d6) 5 12.14 (s, 1H), 8.37 (s, 
1H), 6.98 (s, 1H), 3.69 (s, 3H), 3.66 (s, 3H), 1.42 (s, 9H); MS m/e 293.112 (M+H 293.104 
calcd. for C12H18N2O5). 

25 EXAMPLE 2: 

SOLID PHASE SYNTHESIS OF 3-HYDROXYPYRROLE POLY AMIDES. 

Cycling protocols were optimized to afford high stepwise coupling yields (>99%). 
Deprotection by aminolysis affords up to 100 mg quantities of polyamide. Solid phase 

30 polyamide synthesis protocols were modified from the in situ neutralization Boc-chemistry 
protocols. Schnolzer, M., Alewood, P., Jones, A., Alewood, D., Kent, S.B.H. report rapid in situ 
neutralization for solid phase peptide synthesis Int. J. Peptide. Protein. Res. 1992, 40, 180. 
Coupling cycles are rapid, 72 min per residue for manual synthesis or 180 min per residue for 
machine-assisted synthesis, and require no special precautions beyond those used for ordinary 

35 solid phase peptide synthesis. Manual solid phase synthesis of a pyrrole-imidazole polyamide 
consists of a dichloromethane (DCM) wash, removal of the Boc group with trifluoroacetic acid 
(TFA)/DCM/thiophenol (PhSH), a DCM wash, a DMF wash, taking a resin sample for analysis, 
addition of activated monomer, addition of DIEA if necessary, coupling for 45 min, taking a 
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resin sample for analysis, and a final DMF wash (Figure 5, Table I). In addition, the manual 
solid phase protocol for synthesis of pyrrole-imidazole polyamides has been adapted for use on 
a ABI 430A peptide synthesizer. The aromatic amine of the pyrrole and imidazole do not react 
in the quantitative ninhydrin test. Stepwise cleavage of a sample of resin and analysis by HPLC 
indicates that high stepwise yields (> 99%) are routinely achieved. 

Dicyclohexylcarbodiimide (DCC), Hydroxybenzotriazole (HOBt), 2-(lH-Benzotriazole- 
l-yl)-l,l,3,3-tetramethyluronium hexa-fluorophosphate (HBTU) and 0.2 mmol/gram Boc-(5- 
alanine-(-4-carboxamidomethyl)-benzyl-ester-copoly(styrene-divinylbenzene) resin (Boc-P- 
Pam-Resin) was purchased from Peptides International (0.2 mmol/gram), NovaBiochem (0.6 
mmol/gram), or Peninsula (0.6 mmol/gram). ( (/?)-2-Fmoc-4-Boc-diaminobutyric acid, (S)-2- 
Fmoc-4-Boc-diaminobutyric acid, and (#)-2-amino-4-Boc-diaminobutyric acid were purchased 
from Bachem. A^-diisopropylethylamine (DIEA), AW-dimethylformamide (DMF), N- 
methylpyrrolidone (NMP), DMSO/NMP, Acetic anhydride (Ac 2 0), and 0.0002 M potassium 
cyanide/pyridine were purchased from Applied Biosystems. Dichloromethane (DCM) and 
triethylamine (TEA) were reagent grade from EM, thiophenol (PhSH), 
dimethylaminopropylamine (Dp), Sodium Hydride, (i?)-a-methoxy-a- 
(trifuoromethyl)phenylacetic acid ((/?)MPTA) and (S)-a-methoxy-a- 

(trifouromethyl)phenylacetic acid ((5)MPTA) were from Aldrich, trifluoroacetic acid (TFA) 
Biograde from Halocarbon, phenol from Fisher, and ninhydrin from Pierce. All reagents were 
used without further purification. 

Quik-Sep polypropylene disposable filters were purchased from Isolab Inc. NMR 
spectra were recorded on a General Electric-QE NMR spectrometer at 300 MHz with chemical 
shifts reported in parts per million relative to residual solvent. UV spectra were measured in 
water on a Hewlett-Packard Model 8452A diode array spectrophotometer. Optical rotations 
were recorded on a JASCO Dip 1000 Digital Polarimeter. Matrix-assisted, laser 
desorption/ionization time of flight mass spectrometry (MALDI-TOF) was performed at the 
Protein and Peptide Microanalytical Facility at the California Institute of Technology. HPLC 
analysis was performed on either a HP 1090M analytical HPLC or a Beckman Gold system 
using a RAINEN C l8 , Microsorb MV, 5|im, 300 x 4.6 mm reversed phase column in 0.1% 
(wt/v) TFA with acetonitrile as eluent and a flow rate of 1.0 mL/min, gradient elution 1.25% 
acetonitrile/min. Preparatory reverse phase HPLC was performed on a Beckman HPLC with a 
Waters DeltaPak 25 x 100 mm, 100 \im C18 column equipped with a guard, 0.1% (wt/v) TFA, 
0.25% acetonitrile/min. 18MQ water was obtained from a Millipore MilliQ water purification 
system, and all buffers were 0.2 jim filtered. 
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Activation of Boc-3-methoxypyrrole acid. The amino acid (0.5 mmol) was dissolved in 2 mL 
DMF. HBTU (190 mg, 0.5 mmol) was added followed by DIEA (1 mL) and the resulting 
mixture was shaken for 5 min. 

5 Activation of Imidazole-2-carboxylic acid, y-aminobutyric acid, Boc-glycine, and Boc-P- 
alanine. The appropriate amino acid or acid (2 mmol) was dissolved in 2 mL DMF. HBTU 
(720 mg, 1.9 mmol) was added followed by DIEA (1 mL) and the solution shaken for at least 5 
min. 

10 Activation of Boc-Imidazole acid. Boc imidazole acid (257 mg, 1 mmol) and HOBt (135 mg, 
1 mmol) were dissolved in 2 mL DMF, DCC (202 mg, 1 mmol) is then added and the solution 
allowed to stand for at least 5 min. 

Acetylation Mix. 2 mL DMF, DIEA (710 fiL, 4.0 mmol), and acetic anhydride (380 jiL, 4.0 
15 mmol) were combined immediately before use. 

Manual Synthesis Protocol. Boc-B-alanine-Pam-Resin (1.25 g, 0.25 mmol) is placed in a 20 
mL glass reaction vessel, shaken in DMF for 5 min and the reaction vessel drained. The resin 
was washed with DCM (2 x 30 s.) and the Boc group removed with 80% TFA/DCM/0.5M 

20 PhSH, 1 x 30s., 1 x 20 min The resin was washed with DCM (2 x 30 s.) followed by DMF (1 x 
30 s.) A resin sample (5-10 mg) was taken for analysis. The vessel was drained completely and 
activated monomer added, followed by DIEA if necessary. The reaction vessel was shaken 
vigorously to make a slurry. The coupling was allowed to proceed for 90 min, and a resin 
sample taken. Acetic anhydride (1 mL) was added and the reaction shaken for 5 min. The 

25 reaction vessel was then washed with DMF, followed by DCM. 

Machine-Assisted Protocols. Machine-assisted synthesis was performed on a ABI 430A 
synthesizer on a 0.18 mmol scale (900 mg resin; 0.2 mmol/gram). Each cycle of amino acid 
addition involved: deprotection with approximately 80% TFA/DCM/0.4M PhSH for 3 minutes, 

30 draining the reaction vessel, and then deprotection for 17 minutes; 2 dichloromethane flow 
washes; an NMP flow wash; draining the reaction vessel; coupling for 1 hour with in situ 
neutralization, addition of dimethyl sulfoxide (DMSO)/NMP, coupling for 30 minutes, addition 
of DIEA, coupling for 30 minutes; draining the reaction vessel; washing with DCM, taking a 
resin sample for evaluation of the progress of the synthesis by HPLC analysis; capping with 

35 acetic anhydride/DIEA in DCM for 6 minutes; and washing with DCM. A double couple cycle 
is employed when coupling aliphatic amino acids to imidazole, all other couplings are 
performed with single couple cycles. 
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The ABI 43 OA synthesizer was left in the standard hardware configuration for NMP- 
HOBt protocols. Reagent positions 1 and 7 were DIEA, reagent position 2 was TFA/0.5M 
thiophenol, reagent position 3 was 70% ethanolamine/methanol, reagent position 4 was acetic 
anhydride, reagent position 5 was DMSO/NMP, reagent position 6 was methanol, and reagent 
position 8 was DMF. New activator functions were written, one for direct transfer of the 
cartridge contents to the concentrator (switch list 21, 25, 26, 35, 37, 44), and a second for 
transfer of reagent position 8 directly to the cartridge (switch list 37, 39, 45, 46). 

Boc-Py-OBt ester (357 mg, I mmol) was dissolved in 2 ml DMF and filtered into a 
synthesis cartridge. Boc-Im acid monomer was activated (DCC/HOBt), filtered, and placed in a 
synthesis cartridge. Imidazole-2-carboxylic acid was added manually. At the initiation of the 
coupling cycle the synthesis was interrupted, the reaction vessel vented and the activated 
monomer added directly to the reaction vessel through the resin sampling loop via syringe. 
When manual addition was necessary an empty synthesis cartridge was used. Aliphatic amino 
acids (2 mmol) and HBTU (1.9 mmol) were placed in a synthesis cartridge. 3 ml of DMF was 
added using a calibrated delivery loop from reagent bottle 8, followed by calibrated delivery of 
1 ml DIEA from reagent bottle 7, and a 3 minute mixing of the cartridge. 

The activator cycle was written to transfer activated monomer directly from the cartridge to 
the concentrator vessel, bypassing the activator vessel. After transfer, 1 ml of DIEA was 
measured into the cartridge using a calibrated delivery loop, and the DIEA solution combined 
with the activated monomer solution in the concentrator vessel. The activated ester in 2:1 
DMF/DIEA was then transferred to the reaction vessel. All lines were emptied with argon 
before and after solution transfers. 

ImImOpPy-y-ImPyPyPy-$-Dp. ImlmOpPy-y-ImPyPyPy-P-Pam-Resin was synthesized 
in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 
mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of polyamides 
containing imidazole and pyrrole amino acids. J. Am, Chem. Soc. 118, 6141-6146 (1996); also 
see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was incorporated by 
placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine synthesis cartridge. 
Upon automated delivery of DMF (2 mL) and DIEA (I mL) activation occurs. A sample of 
ImlmOpPy-y-IniPyPyPy-P-Pam-Resin (400 mg, 0.40 mmol/gram) was placed in a glass 20 mL 
peptide synthesis vessel and treated with neat dimethylaminopropylamine (2 mL) and heated 
(55 °C) with periodic agitation for 16 h. The reaction mixture was then filtered to remove resin, 
0.1% (wt/v) TFA added (6 mL) and the resulting solution purified by reversed phase HPLC. 
ImlmOpPy-y-ImPyPyPy-P-Dp is recovered upon lyophilization of the appropriate fractions as a 
white powder (97 mg, 49% recovery). UV (H 2 0) ?w 246, 316 (66,000); l H NMR (DMSO-4,) 
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5 10.24 (s, 1 H), 10.14 (s, 1 H), 9.99 (s, 1 H), 9.94 (s, 1 H), 9.88 (s, 1 H), 9.4 (br s, 1 H), 9.25 (s, 
I H), 9.11 (s, 1 H), 8.05 (m, 3 H), 7.60 (s, 1 H), 7.46 (s, 1 H), 7.41 (s, 1 H), 7.23 (d, 1), 7.21 (d, 
1 H), 7.19 (d, 1 H), 7.13 (m, 2 H), 7.11 (m, 2 H), 7.02 (d, 1 H), 6.83 (m, 2 H) t 3.96 (s, 6 H), 
3,90 (s, 3 H), 3.81 (m, 6 H), 3.79 (s, 3 H), 3.75 (d, 9 H), 3.33 (q, 2 H, J = 5.4 Hz), 3.15 (q, 2 H, 
5 y * 5.5 Hz), 3.08 (q, 2 H, J« 6.0 Hz), 2.96 (quintet, 2 H, J = 5.6 Hz), 2.70 (d, 6 H, J= 4.5 Hz), 
2.32 (m, 4 H), 1.71 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1253.5 (1253.6 calc. for 
C 5 8H 72 N 22 O n ). 

ImlmHpPy-y-ImPyPyPy. In order to remove the methoxy protecting group, a sample of 

10 ImlmOpPy-y-ImPyPyPy-p-Dp (5 mg, 3.9 ^imol) was treated with sodium thiophenoxide at 100 
°C for 2 h. DMF (1000 \iL) and thiophenol (500 ^L) were placed in a (13 x 100 mm) disposable 
Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in mineral oil (100 mg) was 
slowly added. Upon completion of the addition of the sodium hydride, ImlmOpPy-y-ImPyPyPy- 
P-Dp (5 mg) dissolved in DMF (500 ^L) was added. The solution was agitated, and placed in a 

15 100 °C heat block, and deprotected for 2 h. Upon completion of the reaction the culture tube 
was cooled to 0°C, and 7 ml of a 20 % (wt/v) solution of trifluoroacetic acid added. The 
aqueous layer is separated from the resulting biphasic solution and purified by reversed phase 
HPLC. ImlmHpPy-y-ImPyPyPy-p-Dp is recovered as a white powder upon lyophilization of 
the appropriate fractions (3.8 mg, 77 % recovery). UV (H 2 0) X^ x 246, 312 (66,000); 3 H NMR 

20 (DMSO-d,) 5 10.34 (s, 1 H), 10.24 (s, 1 H), 10.00 (s, 2 H), 9.93 (s, 1 H), 9.87 (s, 1 H), 9.83 (s, 
1 H), 9.4 (br s, 1 H), 9,04 (s, 1 H), 8.03 (m, 3 H), 7.58 (s, 1 H), 7.44 (s, 1 H), 7.42 (s, 1 H), 7.23 
(s, 1 H), 7.20 (m, 3 H), 7.12 (m, 2 H), 7.05 (d, 1 H), 7.02 (d, 1 H), 6,83 (s, 1 H), 6.79 (s, 1 H), 
3.96 (s, 6 H), 3.90 (s, 3 H), 3.81 (s, 6 H), 3.79 (s, 3 H), 3.75 (d, 6 H), 3.33 (q, 2 H, J= 5.4 Hz), 
3.14 (q, 2 H, J= 5.4 Hz), 3.08 (q, 2 H,7= 6.L Hz), 2.99 (quintet, 2 H,/= 5.4 Hz), 2.69 (d, 6 H, 

25 J = 4.2 Hz), 2.31 (m, 4 H), 1.72 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1239.6 (1239.6 
calc. for C 57 H 7l N 22 0n). 

ImImPyPy-y-ImOpPyPy-$-Dp. InilmPyPy-y-ImOpPyPy-p-Pam-Resin was synthesized 
in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 

30 mmol/g) as described for ImlmOpPy-y-ImPyPyPy-P-Dp. A sample of ImlmPyPy-y-ImOpPyPy- 
P-Pam-Resin (400 mg, 0.40 mmol/gram) was placed in a glass 20 mL peptide synthesis vessel 
and treated with neat dimethylaminopropylamine (2 mL) and heated (55 °C) with periodic 
agitation for 16 h. The reaction mixture was then filtered to remove resin, 0.1% (wt/v) TFA 
added (6 mL) and the resulting solution purified by reversed phase HPLC. ImlmPyPy-y- 

35 ImOpPyPy-P-Dp is recovered upon lyophilization of the appropriate fractions as a white 
powder (101 mg, 50% recovery). UV (H 2 0) 246, 316 (66,000); MALDI-TOF-MS 

(monoisotopic), 1253.6(1253.6 calc. for C 58 H 72 N 22 O n ). 
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ImlmPyPy-y-ImHpPyPy. A sample of ImlmPyPy-y-ImOpPyPy-p-Dp (5 mg, 3.9 \xmo\) 
was treated with sodium thiophenoxide and purified by reversed phase HPLC as described for 
ImlmHpPy-y-ImPyPyPy-p-Dp. ImlmPyPy-y-ImHpPyPy-P-Dp is recovered upon lyophilization 
of the appropriate fractions as a white powder (3.2 mg, 66 % recovery). UV (H 2 0) 246, 
5 312 (66,000); MALDI-TOF-MS (monoisotopic), 1239.6 (1239.6 calc. for C 5 7H 7l N 22 O u ). 

ImPyPy-y-OpPyPy~$~Dp. ImPyPy-y-OpPyPy-P-Pam-Resin was synthesized in a 
stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 
mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of polyamides 

10 containing imidazole and pyrrole amino acids. J. Am, Chem. Soc. 118, 6141-6146 (1996); also 
see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was incorporated by 
placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine synthesis cartridge. 
Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation occurs. A sample of 
ImPyPy-y-OpPyPy-P-Pam-Resin (400 mg, 0.45 mmol/ gram) was placed in a glass 20 mL 

15 peptide synthesis vessel and treated with neat dimethylaminopropylarnine (2 mL) and heated 
(55 °C) with periodic agitation for 16 h. The reaction mixture was then filtered to remove resin, 
0.1% (wt/v) TFA added (6 mL) and the resulting solution purified by reversed phase HPLC. 
ImPyPy-y-OpPyPy-P-Dp is recovered upon lyophilization of the appropriate fractions as a 
white powder (45 mg, 25% recovery). UV (H 2 0) A^ ax 246, 310 (50,000); ! H NMR (DMSO-J*) 

20 5 1 0.45 (s, 1 H), 9.90 (s, 1 H), 9.82 (s, 1 H), 9.5 (br s, 1 H), 9.38 (s, 1 H), 9.04 (s, 1 H), 8.02 (m, 
3 H), 7.37 (s, 1 H), 7.25 (m, 2 H), 7.15 (d, 1 H, J = 1.6 Hz), 7.1 1 (m, 2 H), 7.09 (d, 1 H), 7.03 
(d, 1 H), 6.99 (d, 1 H), 6.87 (d, 1 H), 6.84 (d, 1 H), 3.96 (s, 3 H), 3.81 (s, 6 H), 3.77 (s, 6 H), 
3.76 (s, 3 H), 3.74 (s, 1 H), 3.34 (q, 2 H, J = 5.6 Hz), 3.20 (q, 2H,J= 5.8 Hz), 3.09 (q, 2 H, 7- 
6.1 Hz), 2.97 (quintet, 2 H, J = 5.3 Hz), 2.70 (d, 6 H,y- 3.9 Hz), 2.34 (m, 4 H), 1.73 (m, 4 H); 

25 MALDI-TOF-MS (monoisotopic), 1007.6 (1007.5 calc. for C 48 H 6? N !6 0 9 ). 

ImPyPy-y-HpPyPy. In order to remove the methoxy protecting group, a sample of 
ImPyPy-y-OpPyPy-P-Dp (5 mg, 4.8 ^mol) was treated with sodium thiophenoxide at 100 °C 
for 2 h. DMF (1000 |iL) and thiophenol (500 |iL) were placed in a (13 x 100 mm) disposable 

30 Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in mineral oil (100 mg) was 
slowly added. Upon completion of the addition of the sodium hydride, ImlmPyPy-y-ImOpPyPy- 
p-Dp (5 mg) dissolved in DMF (500 nL) was added. The solution was agitated, and placed in a 
100 °C heat block, and deprotected for 2 h. Upon completion of the reaction the culture tube 
was cooled to 0°C, and 7 ml of a 20 % (wt/v) solution of trifluoroacetic acid added. The 

35 aqueous layer is separated from the resulting biphasic solution and purified by reversed phase 
HPLC. ImlmHpPy-y-ImHpPyPy-P-Dp is recovered as a white powder upon lyophilization of 
the appropriate fractions (2.5 mg, 52 % recovery). UV (H 2 0) 246, 310 (50,000); l H NMR 
(DMSO-tf*) 5 10.44 (s, 1 H), 10.16 (s, 1 H\ 9.90 (s, 1 H), 9.77 (s, 1 H), 9.5 (br s, 1 H), 9.00 (s, 
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1 H), 8.03 (m, 3 H), 7.37 (s, 1 H), 7.26 (m, 2 H), 7.14 (d, 1 H, J = 1.7 Hz), 7.12 (m, 2 H), 7.02 
(d, 1 H), 6.93 (d, 1 H), 6.88 (d, I H), 6.82 (d, 1 H), 6.72 (d, 1 H), 3.96 (s, 3 H), 3.81 (s, 6 H), 
3.77 (s, 3 H), 3.76 (s, 3 H), 3.74 (s, I H), 3.36 (q, 2H,7= 5.4 Hz), 3.22 (q, 2H,J = 5.9 Hz), 
3.09 (q, 2 H, J = 5.5 Hz), 2.98 (quintet, 2 H, J = 5.3 Hz), 2.70 (d, 6 H, J = 4.3 Hz), 2.34 (m, 4 
5 H), 1.78 (m, 4 H); MALDI-TOF-MS (monoisotopic), 994.2 (993.5 calc. for C 47 H 61 N 16 0 9 ). 



Table 5. Mass spectral characterization of Op and Hp polyamides, synthesized and purified as 
described for ImlrnOpPy-y-ImPyPyPy-P-Dp and ImImHpPy-y-ImPyPyPy-3-Dp. 





POLY AMIDE 


FORMULA (M+H)CALCD 


FOUND 


10 


ImOpPy-y-PyPyPy-P-Dp 


C 48 H 63 N l6 0 9 


10O7.5 


1007.5 




ImHoPv-v-PvPvPv-B-Dn 


C 4 7H 61 N 16 0 9 


993.5 


993.2 




ImPvOo-v-PvPvPv-B-Do 


C48H63N16O9 


1007.5 


1007.5 




ImPvHo-Y-PvPvPv-B-DD 


C 4 7H 61 N 16 0 9 


993.5 


993.4 




IitiPvPv-y-OoPvPv-B-Dd 


C 48 H 63 Ni 6 0 9 


1007.5 


1007.6 


15 


ImPvPv-v-HDPvPv-B-Dn 

11114. jr i _y jf iil^i jr x jf ^/ 1— ' Ly 


C47H6|Ni 6 0 9 


993.5 


993.2 




ImPvPv-Y-PvOoPv-B-Do 


C 48 H 63 Ni 6 0 9 


1007.5 


1007.5 




TmPvPv-Y-PvHoPv-B-Dn 


C 4 7H6iNi60 9 


993.5 


993.4 




ImOoOo-Y-PvPvPv-B-Do 




1037.5 


1037.5 




i iTixiprip-y-i y r yr y- p -up 


u xt r\ 
^4 7 H 6 |M 16 (J|o 






20 


ImlmOpPy-y-ImPyPyPy-p-Dp 


C 58 H 72 N 22 O n 


1253.6 


1253.5 




ImlmHpPy-y-ImPyPyPy-P-Dp 


C S 7H 71 N 22 0 1I 


1239.6 


1239.6 




ImlmPyPy-y-ImOpPyPy-P-Dp 


C 58 H 72 N 22 Ou 


1253.6 


1253.6 




ImlmPyPy-y-ImHpPyPy-P-Dp 


C 5 7H 71 N 22 0n 


1239.6 


1239.6 




ImOpPy Py-y- ImOpPyPy-P-Dp 


C 6 oH 76 N 21 0 12 


1282.6 


1282.6 


25 


ImHpPyPy-y-ImHpPyPy-P-Dp 


C 58 H 72 N 2I 0 I2 


1254.6 


1254.6 




ImlmOpPy-y-ImOpPyPy-P-Dp 


C 59 H 75 N 22 0 12 


1283.6 


1283.6 




ImlmHpPy-y-ImHpPyPy-p-Dp 


C 5 7H 71 N 22 0 !2 


1255.6 


1255.5 




ImOpPyPy-y-PyPyPyPy- P-Dp 


C6oH75N 2 oOn 


1251.6 


1251.5 




ImPyPyPy-y-PyPyOpPy-P-Dp 


C 6 oH7 5 N 2 oOii 


1251.6 


1251.5 


30 


ImlmPyPy-y-ImPyOpPy-P-Dp 


C 58 H 72 N 22 On 


1253.6 


1253.7 




ImOpPyPyPy-y-ImOpPyPyPy-P-Dp 


C7 2 HggN 2 50i4 


1526.7 


1526.6 




ImHpPyPyPy-y-ImHpPyPyPy-P-Dp 


C7oH 8 4N 25 Oi4 


1498.7 


1498.0 




ImlmPyPyPy-y-ImOpOpPyPy-P-Dp 


C7,Hg7N 26 0 14 


1527.7 


1527.7 
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EXAMPLE 3: 
DETERMINATION OF POLY AMIDE 
BINDING AFFINITY AND SEQUENCE SPECIFICITY. 

5 Representative footprint titration experiments are shown in Figures 3 and 10, A 252-bp 

DNA fragment which is typically used for the footprint titration experiments provides 247 
possible 6-bp binding sites for an eight-ring hairpin polyamide. Thus, in addition to providing 
DNA binding affinities, the footprint titration experiments also reveal DNA binding sequence- 
specificity. The DNA binding sequence specificity of polyamides which differ by a single 

10 Py/Py, Hp/Py, or Py/Hp pair for sites which differ by a single A # T or T»A base pair are 
described in Tables 1, 6, and 7. 

Quantitative DNase I Footprint Titrations All reactions were executed in a total volume 
of 400 |j.L (Brenowitz, M. et al., 1986). A polyamide stock solution or H 2 0 (for reference 

15 lanes) was added to an assay buffer containing 3'- 32 P radiolabeled restriction fragment (20,000 
cpm), affording final solution conditions of 10 mM Tris^HCl, 10 mM KC1, 10 mM MgCl 2 , 5 
mM CaCl 2 , pH 7.0, and either (i) a suitable concentration range of polyamide, or (ii) no 
polyamide (for reference lanes). The solutions were allowed to equilibrate for 24 hours at 22°C. 
Footprinting reactions were initiated by the addition of 10 \xL of a stock solution of DNase I (at 

20 the appropriate concentration to give -55% intact DNA) containing 1 mM dithiothreitol and 
allowed to proceed for 7 minutes at 22°C. The reactions were stopped by the addition of 50 |iL 
of a solution containing 2.25 M NaCl, 150 mM EDTA, 23 |aM base pair calf thymus DNA, and 
0.6 mg/ml glycogen, and ethanol precipitated. The reactions were resuspended in 1 x TBE/ 
80% formamide loading buffer, denatured by heating at 85°C for 15 minutes, and placed on ice. 

25 The reaction products were separated by electrophoresis on an 8% polyacrylamide gel (5% 
crosslinking, 7 M urea) in 1 x TBE at 2000 V for 1.5 h. Gels were dried on a slab dryer and 
then exposed to a storage phosphor screen at 22°C. 

Photostimuable storage phosphor imaging plates (Kodak Storage Phosphor Screen 
30 SO230 obtained from Molecular Dynamics) were pressed flat against dried gel samples and 
exposed in the dark at 22°C for 12-24 hours. A Molecular Dynamics 400S Phosphorlmager 
was used to obtain all data from the storage screens (Johnston et al., 1990). The data were 
analyzed by performing volume integration of the target site and reference blocks using the 
ImageQuant v. 3.3 software running on a Compaq Pentium 80. 

35 

Quantitative DNase I Footprint Titration Data Analysis was performed by taking a 
background-corrected volume integration of rectangles encompassing the footprint sites and a 
reference site at which DNase I reactivity was invariant across the titration generated values for 
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the site intensities (I S11C ) and the reference intensity (I ref ). The apparent fractional occupancy 



(9a P p) of the sites were calculated using the equation: 

G«pp - 1 - — — — 



(1) 



where I sile ° and I ref ° are the site and reference intensities, respectively, from a DNase I control 
lane to which no polyamide was added. 



10 



15 



20 



Th e ([L]tot> 9ap P ) data were fit to a Langmuir binding isotherm (eq. 2, n=l) by 
minimizing the difference between 6 app and G m , using the modified Hill equation: 



+ (Gn 



8 trun ) 



K. B fLp 



(2) 



1 + Ka n [L] P tot 

where [L tol ] is the total polyamide concentration, K a is the equilibrium association constant, and 
8 min and 9 max are the experimentally determined site saturation values when the site is 
unoccupied or saturated, respectively. The data were fit using a nonlinear least-squares fitting 
procedure of KaleidaGraph software (v. 3.0. 1, Abelbeck Software) with K a , 9^, and G min as the 
adjustable parameters. The goodness of fit of the binding curve to the data points is evaluated 
by the correlation coefficient, with R > 0.97 as the criterion for an acceptable fit. Four sets of 
acceptable data were used in determining each association constant. All lanes from a gel were 
used unless a visual inspection revealed a data point to be obviously flawed relative to 
neighboring points. The data were normalized using the following equation: 

6 eop - Gmin 

0 norm = ————— (3) 
9 max - Gmin 



TABLE 6 Discrimination of 5'-TGTAA-3' and 5'-TGTTA-3'* 



Pair' 5'-TGTAA-3' 



5'-TGTTA-3' 



5'-T G T A A-3' 5 ' -T G T T A-3' 
y y 3'-A CAT T-5" 3'-A C A A T-5* 



K d = 0.014 \lM 



K d = 0.007 nM 



Py/Hp 



3" -A C A 



5 4 -T G T A A-3* 5*-T G T T A-3' 

•OOn #OCX 



T-5' 



3' -A C A 



T-5' 



0.36 



K 6 = 0.20 uM 



K d = 0.56 uM 



5'-T G T[a]a-3' 5*-T G T[*1a-3' 

•OCSK #o®^ 

Hp/Py +»OO0 ; ^KKX)0 ; 14 

3'-A C aItJT-5' 3*-A C aIaJT-S' 
* d = 4.0 uM * d = 0.28 

"The reported equilibrium dissociation constants are the mean values 
obtained from two DNase I footprint titration experiments on a 3' ^P 
labeled 370-bp pDEHl EcoRl/Pvull DNA restriction fragment 13 . The 
assays were carried out at 22 °C, pH 7.0 in the presence of 10 mM 
Tris^HCl, 10 mM KC1. 10 mM MgClj, and 5 mM CaCi*. 
tRing pairing opposite T\A and A-T in the third position. 
^Calculated as K ,(5'-TGTAA-3'W K .(S'-TCTrA-^. 
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TABLE 7 Discrimination of 5'-TGTTT-3' and 5-TGATT-3' 



Pairt 



5'-TGATT-3' 



5'-TGTTT-3' 



Py/Py 



5*-t ^jgA T ~ 3 ' 

+><>opo^ 

3'-A c It] A A-5' 
K d = 0.026 (iM 



5'-T G 




5.2 



5'-T G ft X T-3* 

tB<x 

Hp/Py +XK>CCX 

3'-A C ISJA A-5* 
AT d = 0.53fiM 



5'-T G T T T-3* 

+>ooccy 

3'-A C [A] A A-5' 
K d = 0.008 



66 



Py/Hp 




T T-3' 



A A-5' 




T T-3' 



A A-5' 



K d = 0.33 ^M 



K 6 = 0.59 



0.56 



The reported equilibrium dissociation constants are the mean values 
obtained from two DNase I footprint titration experiments. The assays 
were carried out at 22 °C, pH 7.0 in the presence of 10 mM Tris»HCI, 
10 mM KC1, 10 mM MgCl 2 , and 5 mM CaCK 
tRing pairing opposite T*A and A»T in the third position. 
^Calculated as K d (5'-TGATT -3')/K d (5'-TCTTT-3'). 



EXAMPLE 5: 

5 PREPARATION OF A BIFUNCTIONAL Hp-Py-Im-POLY AMIDE. 

ImImOpPy^-lmPyPyPy-$-Dp-NH2- ImlmOpPy-y-ImPyPyPy-p-Pam-Resin was 
synthesized in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam- 
Resin (0.66 mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of 

10 polyamides containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 
(1996); also see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was 
incorporated by placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine 
synthesis cartridge. Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation 
occurs. A sample of ImlmOpPy-y-ImPyPyPy-p-Pam-Resin (400 mg, 0.40 mmol/ gram) was 

15 placed in a glass 20 mL peptide synthesis vessel and treated with neat S^'-diamino-N- 
methyldipropylamine (2 mL) and heated (55 °C) with periodic agitation for 16 h. The reaction 
mixture was then filtered to remove resin, 0.1% (wt/v) TFA added (6 mL) and the resulting 
solution purified by reversed phase HPLC. ImImOpPy-y-ImPyPyPy-p-Dp-NH2 is recovered 
upon lyophilization of the appropriate fractions as a white powder (93 mg, 46% recovery). UV 

20 (H2O) X max 246, 316 (66,000); *H NMR (DMSO-rftf) 5 10.34 (s, 1 H), 10.30 (br s, 1 H), 10.25 
(s, 1 H), 9.96 (s, 1 H), 9.95 (s, 1 H), 9.89 (s, 1 H), 9.24 (s, 1 H), 9.11 (s, 1 H), 8.08 (t, 1 H,J = 
5.6 Hz), 8.0 (m, 5 H), 7.62 (s, 1 H), 7.53 (s, 1 H), 7.42 (s, 1 H), 7.23 (d, 1H, J= 1.2 Hz), 7.21 
(m, 2 H), 7.15 (m, 2 H), 7.13 (d, 1 H), 7.1 1 (m, 2 H), 7.04 (d, 1 H), 6.84 (m, 3 H), 3.98 (s, 3 H), 
3.97 (s, 3 H), 3.92 (s, 3 H), 3.82 (m, 6 H), 3.80 (s, 3 H), 3.77 (d, 6H), 3.35 (q, 2H,7= 5.8 Hz) 

25 3.0-3.3 (m, 8 H), 2.86 (q, 2H,;= 5.4 Hz), 2.66 (d, 3 H, J = 4.5 Hz), 2.31 (m, 4 H), 1.94 
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(quintet, 2 H, J - 6.2 Hz), 1.74 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1296.0 (1296.6 
calc. for C6OH78N23O11). 

ImImOpPy-y-ImPyPyPy-$-Dp-EDTA. Excess EDTA-dianhydride (50 mg) was dissolved 
5 in DMSO/NMP (1 mL) and DEEA (1 mL) by heating at 55 °C for 5 min. The dianhydride 
solution was added to ImImOpPy-y-ImPyPyPy-P-NH2 (13 mg, 10 jimol) dissolved in DMSO 
(750 |iL). The mixture was heated (55 °C, 25 min.) and the remaining EDTA-anhydride 
hydrolyzed (0.1M NaOH, 3 mL, 55 °C, 10 min). Aqueous TFA (0.1% wt/v) was added to 
adjust the total volume to 8 mL and the solution purified directly by reversed phase HPLC to 
10 provide ImlmOpPy-y-ImPyPyPy-P-Dp-EDTA as a white powder upon lyophilization of the 
appropriate fractions (5.5 mg, 40% recovery). MALDI-TOF-MS (monoisotopic), 1570.9 
(1570.7 calc. for C70H92N25O18). 

ImImHpPy~Y-lmPyPyPy-$-Dp~EDTA. In order to remove the methoxy protecting group, 
15 a sample of ImlmOpPy-y-ImPyPyPy-p-Dp-EDTA (5 mg, 3.1 ^lmol) was treated with sodium 
thiophenoxide at 100 °C for 2 h. DMF (1000 jiL) and thiophenol (500 nL) were placed in a (13 
x 100 mm) disposable Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in 
mineral oil (100 mg) was slowly added. Upon completion of the addition of the sodium hydride, 
ImlmOpPy-y-ImPyPyPy-P-Dp-EDTA (5 mg) dissolved in DMF (500 jiL) was added. The 
20 solution was agitated, and placed in a 100 °C heat block, and deprotected for 2 h. Upon 
completion of the reaction the culture tube was cooled to 0°C, and 7 ml of a 20 % (wt/v) 
solution of trifluoroacetic acid added. The aqueous layer is separated from the resulting biphasic 
solution and purified by reversed phase HPLC. ImlmHpPy-y-ImPyPyPy-p-Dp-EDTA is 
recovered as a white powder upon lyophilization of the appropriate fractions (3.2 mg, 72 % 
25 recovery). UV (H2O) ^max 246, 312 (66,000); MALDI-TOF-MS (monoisotopic), 1556.6 
(1556.7 calc. for C69H90N25O18). 

ImImPyPy-y-ImOpPyPy-$-Dp-NH2- ImlmPyPy-y-ImOpPyPy-P-Pam-Resin was 
synthesized in a stepwise fashion by machine-assisted solid phase methods from Boc-p-Pam- 

30 Resin (0.66 mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of 
polyamides containing imidazole and pyrrole amino acids, J. Am. Chem. Soc. 118, 6141-6146 
(1996); also see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was 
incorporated by placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine 
synthesis cartridge. Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation 

35 occurs. A sample of ImlmPyPy-y-ImOpPyPy-p-Pam-Resin (400 mg, 0.40 mmol/gram) was 
placed in a glass 20 mL peptide synthesis vessel and treated with neat 3,3 , -diamino-;V- 
methyldipropylamine (2 mL) and heated (55 °C) with periodic agitation for 16 h. The reaction 
mixture was then filtered to remove resin, 0.1% (wt/v) TFA added (6 mL) and the resulting 
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solution purified by reversed phase HPLC. ImImPyPy-y-ImOpPyPy-p-Dp-NH2 is recovered 
upon lyophilization of the appropriate fractions as a white powder (104 mg, 54% recovery). UV 
(H2O) X max 246, 316 (66,000); MALDI-TOF-MS (monoisotopic), 1296.6 (1296.6 calc. for 
C60H78N23OH). 

ImlmPyPy-y-ImOpPyPy-fi-Dp-EDTA. Excess EDTA-dianhydride (50 mg) was dissolved 
in DMSO/NMP (1 mL) and DIEA (1 mL) by heating at 55 °C for 5 min. The dianhydride 
solution was added to ImImPyPy-y-ImOpPyPy-P-NH2 (13 mg, 10 nmol) dissolved in DMSO 
(750 )iL). The mixture was heated (55 °C, 25 min.) and the remaining EDTA-anhydride 
hydrolyzed (0.1M NaOH, 3 mL, 55 °C, 10 min). Aqueous TFA (0.1% wt/v) was added to 
adjust the total volume to 8 mL and the solution purified directly by reversed phase HPLC to 
provide ImlmPyPy-y-ImOpPyPy-P-Dp-EDTA as a white powder upon lyophilization of the 
appropriate fractions (5.9 mg, 42% recovery). MALDI-TOF-MS (monoisotopic), 1570.8 
(1570.7 calc. for C70H92N25O18). 

ImlmPyPy-y'ImHpPyPy-fi'Dp-EDTA. In order to remove the methoxy protecting group, 
a sample of ImlmPyPy-y-ImOpPyPy-p-Dp-EDTA (5 mg, 3.1 jimol) was treated with sodium 
thiophenoxide at 100 °C for 2 h. DMF (1000 \iL) and thiophenol (500 |iL) were placed in a (13 
x 100 mm) disposable Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in 
mineral oil (100 mg) was slowly added. Upon completion of the addition of the sodium hydride, 
ImlmPyPy-Y-ImOpPyPy-P-Dp-EDTA (5 mg) dissolved in DMF (500 nL) was added. The 
solution was agitated, and placed in a 100 °C heat block, and deprotected for 2 h. Upon 
completion of the reaction the culture tube was cooled to 0°C, and 7 ml of a 20 % (wt/v) 
solution of trifluoroacetic acid added. The aqueous layer is separated from the resulting biphasic 
solution and purified by reversed phase HPLC. ImlmPyPy-y-ImHpPyPy-P-Dp-EDTA is 
recovered as a white powder upon lyophilization of the appropriate fractions (3,2 mg, 72 % 
recovery). UV (H2O) X. max 246, 312 (66,000); MALDI-TOF-MS (monoisotopic), 1555.9 
(1556.7 calc. for C69H90N25O18). 

EXAMPLE 6: 

DETERMINATION OF POLY AMIDE BINDING ORIENTATION 

Affinity cleavage experiments using hairpin polyamides modified with EDTA»Fe(II) at 
either the C-terminus or on the y-turn, were used to determine polyamide binding orientation 
and stoichiometry. The results of affinity cleavage experiments are consistent only with 
recognition of 6-bp by an 8-ring hairpin complex and rule out any extended 1:1 or overlapped 
complex formation. In addition, affinity cleavage experiments reveal hairpin formation 
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supporting the claim that it is the Hp/Py and Py/Hp pairing which form at both match and 
mismatch sites to discriminate A # T from T»A. 

Affinity cleavage reactions were executed in a total volume of 40 |iL. A stock solution of 
5 polyamide or H 2 0 was added to a solution containing labeled restriction fragment (20,000 
cpm), affording final solution conditions of 25 mM Tris-Acetate, 20 mM NaCl, 100 (j.M/bp calf 
thymus DNA, and pH 7.0. Solutions were incubated for a minimum of 4 hours at 22°C. 
Subsequently, 4 |iL of freshly prepared 100 |iM Fe(NH4)2(S0 4 )2 was added and the solution 
allowed to equilibrate for 20 min. Cleavage reactions were initiated by the addition of 4 fiL of 
10 100 mM dithiothreitol, allowed to proceed for 30 min at 22 °C, then stopped by the addition of 
10 jaL of a solution containing 1.5 M NaOAc (pH 5.5), 0.28 mg/mL glycogen, and 14 jaM base 
pairs calf thymus DNA, and ethanol precipitated. The reactions were resuspended in lx 
TBE/80% formamide loading buffer, denatured by heating at 85 °C for 15 min, and placed on 
ice. The reaction products were separated by electrophoresis on an 8% polyacrylamide gel 
15 (5% cross-link, 7 M urea) in lx TBE at 2000 V for 1.5 hours. Gels were dried and exposed to a 
storage phosphor screen. Relative cleavage intensities were determined by volume integration 
of individual cleavage bands using ImageQuant software. 
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EXAMPLE 7: 

IMPROVEMENT TO POLYAMIDE SEQUENCE SPECIFICITY. 

The polyamides of this invention provide improved specificity relative to existing 
polyamide technology. Turner, J. T., Baird, E. E., and Dervan, P.B. describe the recognition of 
seven base pair sequences in the minor groove of DNA by ten-ring pyrrole-imidazole 
polyamide hairpins J. Am. Chem. Soc. 1997 119, 7636. For example, quantitative DNasel 
footprint titrations reveal that the 10-ring hairpin ImPyPyPyPy-y-ImPyPyPyPy-P-Dp binds a 5'- 
TGTAACA-3- sequence with an equlibrium dissociation constant of 0.083 nM, and 18-fold 
specificity versus a single base mismatch site. A number of other sites are also bound on the 
252-bp DNA fragment used for the footprint titration experiments. (Figure 13). Introduction of 
a Hp/Py and Py/Hp pair in the 10-ring polyamide, IrnHpPyPyPy-y-ImHpPyPyPy-P-Dp, to 
recognize a T # A and A»T within the 7 -bp target sequence, increases the sequence-specificty. For 
example, a single base mismatch site 5'-TGGAACA-3 is discriminated by > 5000- fold (Figure 
13, Table 8). In fact all 245 7-bp mismatch sites present on the restriction fragment are 
discriminated > 5000-fold by the polyamide ImHpPyPyPy-y-ImHpPyPyPy-p-Dp (Figure 13). 
For cases where three A,T base pairs are present in succession it is preferred to substitute Py/Py 
in place of at least one Hp/Py or Py/Hp to provide for recognition of A*T and T»A at a single 
position. 



TABLE 8 Equilibrium dissociation constants* 

Polyamidet 5'-TGGTCA-3' 5'-TGGACA-3 f ^rel* 

5'-T G \t\ A [a1 C A-3* 5'-T G [g] T [51 C A-3' 

tOODCK •OOOCK 

3-A c[aJtItJg T-5* 3-A C[CJa[TJG T-5* 

Kg = 0.083 nM K d = 1.5 nM 

5'-T GfTlAfAlc A-3' 5'-T G fol T fxl C A-3' 

Hp/Py +XK>00®^ ^K>Oppte^ >5000 

3'-A CIaJtItJG T-5' 3*-A C [CJ A [Tj G T-5' 

K d = 0.2 nM K d > 1000 nM 

*The reported dissociation constants are the average values obtained from three 
DNase I footprint titration experiments. The standard deviation for each data set is 
less than 15% of the reported number. Assays were carried out in the presence of 10 
mM Tris-HCl 10 mM KC1, 10 mM MgClZ and 5 mM CaCL2 at pH 7.0 and 22 °C 
tRing pairing opposite T*A and A»T in the fourth position. 
iCalcuiated as K d (5'-TGGTACA-3')/K d (5'-TGTAACA-3'). 

EXAMPLE 8: 
USE OF PAIRING CODE 

There are 256 possible four base pair combinations of A, T, G, and C. Of these, there are 
a possible 240 four base pair sequences which contain at least 1 A«T or T*A base pair and 
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therefore can advantageously use an Hp/Py, or Py/Hp carboxamide binding. Polyamides 
binding to any of these sequences can be designed in accordance with the code of TABLE 2. 
Table 9 lists the sixteen eight-ring hairpin polyamides (1-16) which recognize the sixteen 5'- 
WGTNNW-3' sequences (W = A or T, X = A, G, C, or T). Table 10 lists the sixteen eight-ring 

5 hairpin polyamides (17-32) which recognize the sixteen 5'-WGANNW-3' sequences (17-32). 
Table 11 lists the twelve eight-ring hairpin polyamides (33-44) which recognize twelve 5'- 
WGGNNW-3' sequences which contain at least one A,T base pair. Table 1 1 lists the four eight- 
ring hairpin polyamides (G1-G4) which target the four S'-WGGNNWO' sequences (G1-G4) 
which contain exclusively G*C base pairs. Table 12 lists the twelve eight-ring hairpin 

10 polyamides (45-56) which recognize twelve 5'-WGCNNW-3' sequences which contain at least 
one A,T base pair. Table 12 lists the four eight-ring hairpin polyamides (G5-G8) which target 
the four 5'-WGCNNW-3 f sequences (G5-G8) which contain exclusively G»C base pairs. Table 
13 lists the sixteen eight-ring hairpin polyamides (57-72) which recognize the sixteen 5'- 
WTTNNW-3' sequences (57-72), Table 14 lists the sixteen eight-ring hairpin polyamides (73- 

15 88) which recognize the sixteen 5'-WTANNW-3' sequences (73-88). Table 15 lists the sixteen 
eight-ring hairpin polyamides (89-104) which recognize the sixteen 5*-WTGNNW-3' sequences 
(89-104). Table 16 lists the sixteen eight-ring hairpin polyamides (105-120) which recognize 
the sixteen 5 , -WTCNNW-3 > sequences (105-120). Table 17 lists the sixteen eight-ring hairpin 
polyamides (121-136) which recognize the sixteen 5'-WATNTsrW-3' sequences (121-136). 

20 Table 18 lists the sixteen eight-ring hairpin polyamides (137-152) which recognize the sixteen 
5'-WAANNW-3* sequences (137-152). Table 19 lists the sixteen eight-ring hairpin polyamides 
(153-168) which recognize the sixteen 5*-WAGNNW-3' sequences (153-168). Table 20 lists 
the sixteen eight-ring hairpin polyamides (169-184) which recognize the sixteen 5'-WACNNW- 
3' sequences (169-184). Table 21 lists the sixteen eight-ring hairpin polyamides (185-200) 

25 which recognize the sixteen S'-WCTNNWO' sequences (185-200). Table 22 lists the sixteen 
eight-ring hairpin polyamides (201-216) which recognize the sixteen 5'-WCANNW-3* 
sequences (201-216). Table 23 lists the twelve eight-ring hairpin polyamides (217-228) which 
recognize the twelve 5'-WCGNNW-3' sequences which contain at least one A,T base pair. 
Table 23 lists the four eight-ring hairpin polyamides (G9-G12) which target the four 5'- 

30 WCGNNW-3' sequences (G9-G12) which contain exclusively OG base pairs. Table 24 lists 
the twelve eight-ring hairpin polyamides (229-240) which recognize the twelve 5 T -WCCNNW- 
3' sequences which contain at least one A,T base pair. Table 24 lists the four eight-ring hairpin 
polyamides (G13-G16) which target the four 5'-WCCNNW-3' sequences (G13-G16) which 
contain exclusively OG base pairs. 
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TA BLE 9: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WGTNNW-3* 
DNA sequence aromatic amino acid sequence 



1) 5 1 -W G T T T W-3 ' 1) ImHpHpHp -y- PyPyPyPy 

2) 5'-W G T T A W-3' 2 ) imHpHpPy-y-HpPyPyPy 

3) 5 • -W G T T G W-3 1 3) ImHpHpIm-y- PyPyPyPy 

4) 5'-W G T T C W-3' 4 ) ImHpHpPy-y-ImPyPyPy 

5) 5 1 -W G T A T W-3 1 5) ImHpPyHp-y-PyHpPyPy 

6) 5'-W G T A A W-3' 6 ) ImHpPyPy-y-HpHpPyPy 

7) 5»-W G T A G W-3' 7) imHpPylm-y-PyHpPyPy 

8) 5 1 -W G T A C W-3' 8 ) ImHpPyPy-y- ImHpPyPy 

9) 5 1 -W G T G T W-3 1 9) ImHpImHp-y-PyPyPyPy 

10) 5'-W G T G A W-3 1 10) ImHpImPy-y-HpPyPyPy 

11) 5 1 -W G T G G W-3" ll) ImHpImlm-y-PyPyPyPy 

12) 5'-W G T G C W-3 1 12 ) ImHpImPy-y- ImPyPyPy 

13) 5'-W G T C T W-3 1 13 ) ImHpPyHp-y-PylmPyPy 

14) 5 1 -W G T C A W- 3 1 14 ) ImHpPyPy-y-HpImPyPy 

15) 5 1 -W G T C G W-3' 15 ) ImHpPylm-y- PylmPyPy 

16) 5 1 -W G T C C W-3 1 16 ) ImHpPyPy-y- ImlmPyPy 
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TABLE 10: 8-ring Hairpin Poly amides for recognition of 6-bp 5'-WGANNW-3* 









DNA sequence 


aromatic amino acid sequence 


5 ™ 


17) 


5 ' 


-W G A T T 


W-3 1 


17) tmPyHpHp-y-PyPyHpPy 




18) 


5 * 


-W G A T A 


W-3 • 


18) ImPyHpPy-y-HpPyHpPy 




19) 


5 1 


-W G A T G 


W-3 • 


19) ImPyHpIm-y-PyPyHpPy 


10 














20) 


5' 


-W G A T C 


W-3 • 


20) ImPyHpPy-y- ImPyHpPy 




21) 


5' 


-W G A A T 


W-3 ' 


21) ImPyPyHp-y-PyHpHpPy 


15 


22) 


5' 


-W G A A A 


W-3 ' 


22) ImPyPyPy-y-HpHpHpPy 




23) 


5' 


-W G A A G 


W-3 • 


23) ImPyPylm-y-PyHpHpPy 


*- \j 




j 


-W G A A C 


W-3 1 


24) TmPvPvPv-v - TmHnHnPv 


25) 


5 


-W G A G T 


W-3 1 


25) ImPylmHp-y-PyPyHpPy 




26) 


5 


' -W GAGA 


W-3 » 


26) ImPylmPy-y-HpPyHpPy 






■j 


' -W G A G G 


W-3 • 


0 7 ) TmPvTmTm-v-PvPvHnPv 




28) 


5 


1 -W G A G C 


W-3 1 


28) ImPylmPy-y- ImPyHpPy 




29) 


5 


»-W G A C T 


W-3' 


29) ImPyPyHp-y-PylmHpPy 


30 














30) 


5 


• -W G A C A 


W-3 • 


30) ImPyPyPy-y-HpImHpPy 




31) 


5 


1 -W G A C G 


W-3 ' 


31) ImPyPylm-y-PylmHpPy 


35 


32) 


5 


' -W G A C C 


W-3 1 


32) ImPyPyPy-y- ImlmHpPy 
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TABLE 1 1 : 8-ring Hairpin Polyamides for recognition of 6-bp 5*-WGGNNW-3' 
DNA sequence aromatic amino acid sequence 



10 



15 



20 



25 



30 



35 



33) 5 

34) 5 

35) 5 

36) 5 

37) 5 

38) 5 

39) 5 

40) 5 

41) 5 

42) 5 

43) 5 

44) 5 
Gl) 5 
G2) 5 
G3) 5 
G4) 5 



-W G G T T W-3 
-W G G T A W-3 
-W G G T G W-3 
-W G G T C W-3 
-W G G A T W-3 
-W G G A A W-3 
-W G G A G W-3 
-W G G A C W-3 
-W G G G T W-3 
-W G G G A W-3 
-W G G C T W-3 
-W G G C A W-3 
-W G G G G W-3 
-W G G G C W-3 
-W G G C G W-3 
-W G G C C W-3 



33) ImlmHpHp -y - PyPy PyPy 

34) ImlmHpPy -y-HpPyPyPy 

35) ImlmHpIm-y-PyPyPyPy 

36) ImlmHpPy -y - ImPyPyPy 

37) ImlmPyHp-y-PyHpPyPy 

38) ImlmPyPy-y-HpHpPyPy 
3 9) ImlmPylm-y-PyHpPyPy 

40) ImlmPyPy-y- ImHpPyPy 

41) ImlmlmHp-y- PyPy PyPy 

42) ImlmlmPy-y-HpPyPyPy 

43) ImlmPyHp-y-PylmPyPy 

44) ImlmPyPy-y-HpImPyPy 
Gl) Imlmlmlm-y-PyPyPyPy 
G2) ImlmlmPy-y- ImPyPyPy 
G3 ) ImlmPylm-y- PylmPyPy 
G4) ImlmPyPy-y- ImlmPyPy 
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TABLE 12: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WGCNNW-3' 
DNA sequence aromatic amino acid sequence 



10 



15 



20 



25 



30 



35 



45) 5 1 -W G C T T W-3 

46) 5 1 -W G C T A W-3 

47) 5 1 -W G C T G W-3 

48) 5 1 -W G C T C W-3 

49) 5 1 -W G C A T W-3 

50) 5 1 -W G C A A W-3 

51) 5 1 -W G C AG W-3 

52) 5 1 -W G C A C W-3 

53) 5 » -W G C G T W-3 

54) 5 1 -W G C G A W-3 

55) 5 1 -W G C C T W-3 

56) 5 1 -W G C C A W-3 
G5) 5 • -W G C G G W-3 
G6) 5 1 -W G C G C W-3 
G7) 5 1 -W G C C G W- 3 
G8) 5 ■ -W G C C C W-3 



45) ImPyHpHp -y- PyPylmPy 

46) I mPy Hp Py - y - Hp Py ImPy 

47) ImPyHpIm-y-PyPylmPy 
4 8 ) ImPyHpPy-y- imPylmPy 
4 9) ImPyPyHp-y-PyHpImPy 

50) ImPyPyPy-y-HpHpImPy 

51) ImPyPylm-y-PyHpImPy 

52) ImPyPyPy-y- ImHpImPy 

53) ImPylmHp-y-PyPylcnPy 

54) ImPylmPy-y-HpPylmPy 

55 ) ImPyPyHp-y-PylmlmPy 

56) ImPyPyPy-y-HpImlmPy 
G5) ImPylmlm-y-PyPylmPy 
G6) ImPylmPy-y-ImPylmPy 
G7) ImPyPylm-y-PylmlmPy 
G8) ImPyPyPy-y- ImlmlmPy 
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TABLE 13: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WTTNNW-3' 



DNA sequence 



aromatic amino acid sequence 



57) 5 1 -W T T T T W-3 ■ 

58) 5 1 -W T T T A W-3 1 

59) 5 ' -W T T T G W-3 1 

60) 5 1 -W T T T C W-3 • 

61) 5 1 -W T T A T W-3 1 

62) 5 1 -W T T A A W- 3 ' 

63) 5 1 -W T T A G W-3 1 
.64) 5 1 -W T T A C W-3 • 

65) 5 1 -W T T G T W-3 1 

66) 5 1 -W T T G A W-3 ' 

67) 5 ■ -W T T G G W-3 ' 

68) 5 1 -W T T G C W-3 1 

69) 5 1 -W T T C T W-3 ' 

70) 5 ' -W T T C A W-3 ■ 

71) 5 1 -W T T C G W-3 1 

72) 5 1 -W T T C C W-3 ■ 



5 7 ) HpHpHpHp -y- PyPyPyPy 
5 8 ) HpHpHpPy - y - HpPyPyPy 

5 9 ) HpHpHpIm-y- PyPyPyPy 

60 ) HpHpHpPy-y- ImPyPyPy 

6 1 ) HpHp Py Hp - y - Py Hp PyPy 

62 ) HpHpPyPy-Y-HpHpPyPy 

6 3 ) HpHpPylm-y- Py Hp Py Py 
64 ) HpHpPyPy-y- ImHpPyPy 
6 5 ) HpHplmHp-y- Py Py Py Py 
6 6 ) HpHp ImPy - y - Hp Py Py Py 

67 ) HpHpImlm-y- PyPyPyPy 

68) HpHpImPy-y-ImPyPyPy 

6 9 ) HpHpPyHp -y- Py ImPyPy 

7 0 ) HpHpPyPy -y - Hp ImPy Py 
7 1 ) HpHpPy Im-y- Py ImPyPy 
12) HpHpPyPy-y- imlmPyPy 
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TABLE 14: 8-ring Hairpin Polyamides for recognition of 6-bp 5^WTANNW-3 f 



10 



15 



20 



25 



30 



DNA sequence 



35 







1 - w 

Ft 


T 
X 


A 


T 


T 


XaJ o 

W-3 


74) 


5 


1 -W 


T 


A 


T 


*\ 


Ft — J 


75) 


5 


1 -w 


T 


A 


T 


rj 


w - o 


76) 


5 


1 -w 


T 


A 


T 


P 
I* 


rf — O 


77) 


5 


-w 


T 


A 


A 


T 


W-3 


78) 


5 


-w 


T 


A 


A 


A 


W-3 


79) 


5 


-w 


T 


A 


A 


G 


W-3 


80) 


5 1 


-w 


T 


A 


A 


c 


W-3 


81) 


5 ' 


-w 


T 


A 


G 


T 


W- 3 


82) 


5 ■ 


-w 


T 


A 


G 


A 


W-3 


83) 


5 • 


-w 


T 


A 


G 


G 


W-3 


84) 


5 • 


-w 


T 


A 


6 


C 


W-3 


85) 


5 » 


-w 


T 


A 


C 


T 


W-3 


86) 


5 • 


-w 


T 


A 


C 


A 


W-3 


87) 


5 ' 


-w 


T 


A 


C 


G 


W-3 


88) 


5 • 


-w 


T 


A 


c 


C 


W-3 



aromatic amino acid sequence 



73) HpPyHpHp-y-PyPyHpPy 

74 ) HpPyHpPy-y-HpPyHpPy 
7 5 ) Hp Py Hp Im-y-PyPyHpPy 

76) HpPyHpPy-y- ImPyHpPy 

77) HpPyPyHp-y-PyHpHpPy 

78) HpPyPyPy-y-HpHpHpPy 

79) HpPyPylm-y-PyHpHpPy 

80) HpPyPyPy-y-imHpHpPy 

81) HpPylmHp-y-PyPyHpPy 

82) HpPylmPy-y-HpPyHpPy 

83) HpPylmlm-y-PyPyHpPy 

84) HpPylmPy-y- ImPyHpPy 

85) HpPyPyHp-y-PylmHpPy 

86) HpPyPyPy-y-HpImHpPy 

87) HpPyPylm-y-PylmHpPy 

88) HpPyPyPy-y- imimHpPy 
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T ABLE 15: 8-ring Hairpin Polyamides for recognition of 6-bp S'-WTGNNW-S' 
DNA sequence aromatic amino acid sequence 



89) 


5 ' 


-W T G T T W-3' 


8 9) Hp ImHpHp - y - PyPyPyPy 


90) 


5« 


-W T G T A W-3' 


90) HpImHpPy-y-HpPyPyPy 


91) 


5' 


-W T G T G W-3 ' 


91) Hp ImHp Im-y-PyPyPyPy 


92) 


5 ■ 


-W T G T C W-3' 


92) HpImHpPy-y-lmPyPyPy 


93) 


5 " 


-W T G A T W-3 1 


93) Hp I mPy Hp - y - Py Hp Py Py 


94) 


5' 


-W T G A A W-3 1 


94) Hp ImPy Py - y - HpHp Py Py 


95) 


5 1 


-W T G A G W-3' 


95) HpImPylm-y-PyHpPyPy 


96) 


5 ' 


-W T G A C W-3 • 


96) HpImPyPy-y-ImHpPyPy 


97) 


5' 


-W T G G T W-3 1 


97) HpImlmHp-y-PyPyPyPy 


98) 


5 ' 


-W T G G A W-3 1 


98) HpImlmPy-y-HpPyPyPy 


99) 


5' 


1 -W T G C T W-3 1 


9 9 ) Hp ImPyHp - y - Py I mPy Py 


100) 


5 


' -W T G C A W-3 1 


10 0) HpImPy Py - y - Hp ImPy Py 


101) 


5 


' -W T G G G W-3 1 


101) HpImlmlm-y-PyPyPyPy 


102) 


5 


• -W T G G C W-3 ' 


102 ) HpImlmPy-y- ImPyPyPy 


103) 


5 


1 -W T G C G W-3 1 


103 } HpImPylm-y-PylmPyPy 


104) 


5 


1 -W T G C C W- 3 1 


104 ) HpImPyPy-y- ImlmPyPy 
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TABLE 16: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WTCNNW-3' 



10 



15 



20 



25 



30 



DNA sequence 



35 



105) 


5' 


-W 


T 


C 


T 


T 


W-3 


106) 


5 1 


-W 


T 


c 


T 


A 


W-3 


107) 


5' 


-W 


T 


c 


T 


G 


W-3 


108) 


5' 


-w 


T 


c 


T 


C 


W-3 


109) 


5' 


-w 


T 


c 


A 


T 


W-3 


110) 


5' 


-w 


T 


c 


A 


A 


W-3 


111) 


5' 


-w 


T 


c 


A 


G 


W-3 


112) 


5' 


-w 


T 


c 


A 


C 


W-3 


113) 


5' 


-w 


T 


c 


G 


T 


W-3 


114) 


5* 


-w 


T 


c 


G 


A 


W-3 


115) 


5' 


-w 


T 


c 


C 


T 


W-3 


116) 


5» 


-w 


T 


c 


C 


A 


W-3 


117) 


5' 


-w 


T 


c 


G 


G 


W-3 


118) 


5' 


-w 


T 


c 


G 


C 


W-3 


119) 


5' 


-w 


T 


c 


C 


G 


W-3 


120) 


5' 


-w 


T 


c 


C 


C 


W-3 



aromatic amino acid sequence 

105) HpPyHpHp-y-PyPylmPy 



1 0 6 ) Hp Py HpPy -y - Hp Py ImPy 
107) HpPyHpIm-y-PyPylmPy 
1 0 8 ) Hp Py Hp Py - y - 1 mPy Im Py 
10 9) HpPy PyHp -y - PyHp ImPy 

110) HpPyPy Py -y - HpHpImPy 

111) HpPy Pylm-y-PyHpImPy 

112) HpPy PyPy-y-lmHpImPy 
113 > HpPy ImHp-y-PyPy ImPy 

114) HpPylmPy-y-HpPylmPy 

115) HpPyPyHp-y-PylmlmPy 

116) HpPyPyPy-y-HpImlmPy 

117) HpPylmlm-y-PyPy ImPy 

118) HpPyimPy-y- ImPy ImPy 

119) HpPyPylm-y- PylmlmPy 

120) HpPyPyPy-y- ImlmlmPy 
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TABLE 17: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WATNNW-3* 



DNA sequence 



10 



15 



20 



25 



30 



aromatic amino acid sequence 



35 



121) 5 

122) 5 

123) 5 

124) 5 

125) 5 

126) 5 

127) 5 

128) 5 

129) 5 

130) 5 

131) 5 

132) 5 

133) 5 

134) 5 

135) 5 

136) 5 



-W A T T T W-3 
-W A T T A W-3 
-W A T T G W-3 
-WATT C W-3 
-W A T A T W-3 
-W A T A A W-3 
-W A T A G W-3 
-W A T A C W-3 
-W A T G T W-3 
-W A T G A W-3 
-W A T G G W-3 
-W A T G C W-3 
-W A T C T W-3 
-W A T C A W-3 
-W A T C G W-3 
-W A T C C W-3 



121) PyHpHpHp - y - Py Py PyHp 

122) PyHpHpPy-y-HpPyPyHp 

123) PyHpHpIm-y-PyPyPyHp 

124) PyHpHpPy-y-imPyPyHp 

125) PyHpPyHp-y-PyHpPyHp 

126) PyHp PyPy-y-HpHp PyHp 

127) PyHpPylm-y-PyHpPyHp 
12 8) PyHpPyPy-y-lmHpPyHp 

12 9) PyHp ImHp - y - Py Py PyHp 

130) PyHpImPy-y-HpPyPyHp 

131) PyHpImlm-y-PyPyPyHp 

13 2) PyHpImPy-y-ImPyPyHp 
13 3) PyHp PyHp-y-PylmPyHp 
134) PyHpPyPy-y-HpImPyHp 
13 5) PyHpPylm-y-PylmPyHp 
13 6 ) PyHpPyPy-y- ImlmPyHp 
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TABLE 18: 



8-ring Hairpin Polyamides for recognition of 6-bp S'-WAANNW-3* 



10 



20 



30 



DNA sequence 



/ ) 


c 
O 


1 - w 


A 


A 


T 


T 


W-3 


13 8 ) 


q 


mi 


A 




rri 

T 


A 


W-3 


139) 


~j 


i w w 
rv 


* 
A 


A 


T 


G 


W-3 


140) 


5 


ft 


A 


A 


rri 

T 


C 


W-3 


141) 


5 




A 


A 


A 
A 




W - J 


142) 


5 


1 - w 


A 


21 
x\ 




A 


w - J 


143) 


5 


-W 


A 






/"2 
Vj 


w 1 


144) 


5 1 


-w 


A 


A 






W- J 


145) 


5 1 


-W 


A 


a 

tx 


r: 

Sj 


T 


W - J 


146) 


5 • 


- w 


A 




cz 

VJ 


A 


ivr ^ | 
W — J 


147) 


5 1 


-w 


A 


A 


G 


G 


W-3 ' 


148) 


5 ■ 


-w 


A 


A 


G 


C 


W-3 ' 


149) 


5 ■ 


-w 


A 


A 


C 


T 


W-3 ■ 


150) 


5 ' 


-w 


A 


A 


C 


A 


W-3 1 


151) 


5 ' 


-w 


A 


A 


C 


G 


W-3 ' 


152) 


5 ' 


-w 


A 


A 


C 


C 


W-3 • 



aromatic amino acid sequenc e 

13 7) PyPyHpHp-y-PyPyHpHp 
13 8) PyPyHpPy-y-HpPyHpHp 

13 9) PyPyHpIm-y- PyPyHpHp 

14 0) PyPyHpPy-y-l m pyHpHp 

141) PyPyPyHp-y-PyHpHpHp 

142) PyPyPyPy-y-HpHpHpHp 
14 3) PyPyPylm-y-PyHpHpHp 
14 4 ) Pypypypy-y. imHpHpHp 
14 5) PyPylmHp-y-PyPyHpHp 
14 6) PyPylmPy-y-HpPyHpHp 
14 7) PyPylmlm-y-PyPyHpHp 
14 8 ) PyPylmPy-y- ImPyHpHp 

149) PyPyPyHp-y-PylmHpHp 

150) PyPyPyPy-y-KpImHpHp 

151) PyPyPylm-y-PylmHpHp 

152 ) PyPyPyPy-y- ImlmHpHp 
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TABLE 19: 8-ring Hairpin Polyamides for recognition of 6-bp S'-WAGNN WO* 
DNA sequence aromatic amino acid sequence 



5 


153) 


5 


1 -W 


A 


G 


T 


T 


W-3 ' 


153) PylmHpHp-y-PyPyPyHp 




154) 


5 


1 -W 


A 


G 


T 


A 


W-3 ' 


154) PylmHpPy-y-HpPyPyHp 


10 


155) 


5 


1 -w 


A 


G 


T 


G 


W-3 ' 


155) PylmHpIm-y-PyPy PyHp 


156) 


5 


' -w 


A 


G 


T 


C 


W-3 ' 


156) PylmHpPy-y- ImPyPyHp 




157) 


5 


■ -w 


A 


G 


A 


T 


W-3 ' 


157) PylmPyHp-y-PyHpPyHp 


15 


158) 


5 


• -w 


A 


G 


A 


A 


W-3 • 


15 8) PylmPyPy-y-HpHpPyHp 




159) 


5 


' -w 


A 


G 


A 


G 


W-3 • 


159) PylmPylm-y-PyHpPyHp 


20 


160 ) 


5 


1 -w 


A 


G 


A 


C 


W-3 • 


160) PylmPyPy-y-ImHpPyHp 




161) 


5 


' -w 


A 


G 


G 


T 


W-3 » 


161) PylmlmHp-y-PyPyPyHp 




162) 


5 


' -w 


A 


G 


G 


A 


W-3 ' 


162) PylmlmPy-y-HpPyPyHp 


25 


163) 


5 


' -w 


A 


G 


C 


T 


W-3 « 


163) PylmPyHp-y-PylmPyHp 




164) 


5' 


-W A < 


3 < 


2 A W-3 ' 


164) PylmPyPy-y-HpImPyHp 


30 


165) 


5' 


-W A ( 


3 < 


3 < 


3 W-3 ' 


165) Pylmlmlm-y-PyPyPyHp 




166) 


5 ' 


-W A < 


3 


3 ( 


C W-3 ' 


166) PylmlmPy-y-ImPyPyHp 




167) 


5' 


-W A 


3 


C ( 


3 W-3 1 


167) PylmPylm-y-PylmPyHp 


35 


168) 


5 ' 


-W A i 


3 


c < 


C W-3 ' 


168) PylmPyPy-y- ImlmPyHp 
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TABLE 20: 8-ring Hairpin Polyamides for recognition of 6-bp S'-WACNNW-3* 
DNA sequence aromatic amino acid sequence 



5 


169) 


5 ' 


-W 


A 


C 


T 


T 


W-3 ' 


16 9) PyPyHpHp-y- 


PyPylmHp 




170) 


5 ' 


-W 


A 


C 


T 


A 


W-3 » 


170) PyPyHpPy-y- 


HpPylmHp 


10 


171) 


5 1 


-W 


A 


C 


T 


G 


W-3 « 


171) PyPyHpIm-y- 


PyPylmHp 




172) 


5» 


-W 


A 


C 


T 


C 


W-3 1 


172) PyPyHpPy-y- 


ImPylmHp 




173) 


5 * 


-W 


A 


C 


A 


T 


W-3' 


17 3) PyPyPyHp-y- 


PyHpImHp 


15 


174) 


5' 


-W 


A 


C 


A 


A 


W-3' 


174) PyPyPyPy-y- 


HpHpImHp 




175) 


5 » 


-w 


A 


C 


A 


G 


W-3 ■ 


175) PyPyPylm-y- 


PyHpImHp 


20 


176) 


5 1 


-w 


A 


C 


A 


C 


W-3' 


176) PyPyPyPy-y- 


ImHpImHp 




177) 


5 ' 


-w 


A 


C 


G 


T 


W-3 ' 


177) PyPylmHp-y- 


- PyPylmHp 




178) 


5' 


-w 


A 


C 


G 


A 


W-3 ' 


178) PyPylmPy-y- 


-HpPylmHp 


25 


179) 


5« 


-w 


A 


C 


C 


T 


W-3' 


179) PyPyPyHp-y- 


-PylmlmHp 




180) 


5 


-w 


A 


c 


C 


A 


W-3' 


18 0) PyPyPyPy-y 


-HpImlmHp 


30 


181) 


5 


-w 


A 


c 


G 


G 


W-3 » 


181) PyPylmlm-y 


-PyPylmHp 




182) 


5 


» -w 


A 


c 


G 


C 


W-3 1 


182 ) PyPylmPy-y 


- ImPylmHp 




183) 


5 


' -w 


A 


c 


C 


G 


W-3 1 


183 ) PyPyPylm-y 


- PylmlmHp 


35 


184) 


5 


' -w 


A 


c 


C 


C 


W-3 • 


184 ) PyPyPyPy-y 


- ImlmlmHp 
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TABLE 21: 8-ring Hairpin Polyamides for recognition of 6-bp S'-WCTNNW-S' 





DNA sequence 


aromatic amino acid sequence 


185) 


5' 


-W C T T T W-3 1 


18 5) PvHdHdHd-v - PvPvPvIm 


loo; 


5 ' 


-W C T T A W-3 1 


186) PyHpHpPy-y-HpPyPylm 


IB / ) 


5 ' 


-W C T T G W-3 1 


187) PyHpHpIm-y-PyPyPylm 


1 OQ\ 
XOOf 


5 1 


-W C T T C W-3 1 


186) PyHpHpPy-y-ImPyPylm 


1 O Q \ 


5* 


-W C T A T W-3 1 


189) PyHpPyHp-y-PyHpPylm 


1 QA\ 

xyv ) 


5' 


-W C T A A W-3 ' 


190) PyHpPyPy-y-HpHpPylm 


191) 


5' 


-W C T A G W-3 * 


191) PyHpPylm-y-PyHpPylm 


17Z ) 


5 ' 


-W C T A C W-3 ■ 


192) PyHpPyPy-y- ImHpPylm 


19 J J 


5 ■ 


-W C T G T W-3 1 


193) PyHpImHp-y-PyPyPylm 


194 ) 


5' 


' -W C T G A W-3 1 


194) PyHpImPy-y-HpPyPylm 


195) 


5' 


' -W C T G G W-3 ' 


195) PyHpImlm-y-PyPyPylm 


196) 


5 


' -W C T G C W-3 1 


196) PyHpImPy-y-ImPyPylm 


197) 


5 


• -W C T C T W-3 1 


197) PyHpPyHp-y-PylmPylm 


198) 


5 


' -W C T C A W-3 1 


198) PyHpPyPy-y-HpImPylm 


199) 


5 


1 -W C T C G W-3 1 


199) PyHpPylm-y-PylmPylm 


200) 


5 


' -W C T C C W-3 ' 


200) PyHpPyPy-y-ImlmPylm 
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TABLE 22: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCANNW-3' 











DNA 


. sequence 


aromatic amino acid sequence 


5 




201) 


5 


-w 


c 


A 


T 


T 


W-3 * 


201) Py Py Hp Hp - y - Py Py Hp I m 






202) 


5 1 


-w 


c 


A 


T 


A 


W-3 ■ 


202) PyPyHpPy-y-HpPyHpIm 


10 




203) 


5 


-w 


c 


A 


T 


G 


W-3 • 


203) PyPyHpIm-y-PyPyHpIm 






204) 


5 


-w 


c 


A 


T 


C 


W-3 • 


204) PyPyHpPy-y-lmPyHpIm 






205) 


5 ' 


-w 


c 


A 


A 


T 


W-3 ' 


205) PyPyPyHp-y-PyHpHpIm 


15 




206) 


5 ' 


-w 


c 


A 


A 


A 


W-3 * 


206) PyPyPyPy-y-HpHpHpIm 






207) 


5 ■ 


-w 


c 


A 


A 


G 


W-3» 


207) PyPyPylm-y-PyHpHpIm 


JL\> 




208) 


5 ' 


-w 


c 


A 


A 


C 


W-3 ' 


208) PyPyPyPy-y-ImHpHpIm 






209) 


5' 


-w 


c 


A 


G 


T 


W-3 * 


209) PyPylmHp-y-PyPyHpIm 






210) 


5 ' 


-w 


c 


A 


G 


A 


W-3 1 


210) PyPylmPy-y-HpPyHpIm 


25 




211) 


5 ' 


-w 


c 


A 


G 


G 


W-3 1 


211) PyPylmlm-y- PyPyHpIm 






212) 


5« 


-w 


c 


A 


G 


C 


W-3 ' 


212) PyPylmPy-y- ImPyHpIm 


30 




213) 


5' 


-w 


c 


A 


C 


T 


W-3 ' 


213) PyPyPyHp-y-PylmHpIm 






214) 


5' 


-w 


c 


A 


C 


A 


W-3 1 


214 ) PyPyPyPy-y-HpImHpIm 






215) 


5' 


-w 


c 


A 


C 


G 


W-3 ' 


215) PyPyPylm-y-PylmHpIm 


35 




216) 


5' 


-w 


c 


A 


C 


C 


W-3 1 


216) PyPyPyPy-y-ImlmHpIm 



B1MSDOCID: <WO 9837066A1 JA> 



48 



WO 98/37066 



PCT/US98/01006 



T ABLE 23: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCGNNW-3' 
DNA sequence aromatic amino acid sequence 
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TABLE 24: 8-ring Hairpin Polyamides for recognition of 6-bp S'-WCCNNW^ 
DNA sequence aromatic amino acid sequence 
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EXAMPLE 9: 

Aliphatic/Aromatic amino acid pairing for recognition of the DNA minor groove. 

40 

Selective placement of an aliphatic (3-alanine (P) residue paired side-by-side with either 
a pyrrole (Py) or imidazole (Im) aromatic amino acid is found to compensate for sequence 
composition effects for recognition of the minor groove of DNA by hairpin pyrrole-imidazoie 
polyamides. A series of polyamides were prepared which contain pyrrole and imidazole 
45 aromatic amino acids, as well as 7-aminobutyric acid (y) "turn" and P-alanine "spring" aliphatic 
amino acid residues. The binding affinities and specificities of these polyamides are regulated 
by the placement of paired p/p Py/p and lm/p residues. Quantitative footprint titrations 
demonstrate that replacing two Py/Py pairings in a 12-ring hairpin (6-y-6) with two Py/p 
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pairings affords 10- fold enhanced affinity and similar sequence specificity for an 8-bp target 
sequence. 

Table 25 Equilibrium association constants ( M* * ) for polyamides. ac 



Polyamidc 5" 


-TGTTAACA-3' 


5'-TGTGAACA-3* 


Specificity^ 


»-c>CK>Ch>C^ 


2.5 k 10 9 


3.9 x 10 s 


6 


9-000000^ 


1 .3 x 10 9 


2.0 x 10 s 


7 


•-OOOO-On 


1.7 x 10 10 


Z. / X 1U 


5 


♦OKKKX 


Ux 10' 1 


2.2 x I0 9 


55 




6.6 x 10* 


2.5 x 10 s 


26 




4.5 x 10 t0 


7.7 x 10 9 


6 


•0-<>000\ 


2.7 x 10 10 


5.7 x 10 9 


5 


•o-o-xxx 


s 1 x 10 8 


s 1 x 10* 


1 



a Values reported are the mean values obtained from three DNase I 
footprint titration experiments. *The assays were carried out at 22 °C at 
pH 7.0 in the presence of 10 mM Tris-HCl, 10 mM KCl. 10 mM MgCl 2 , 
and 5 mM CaCK. c Match site association constants and specificities 
higher than the parent hairpin are shown in boldtype. ^Specificity is 
calculated as K. a ( match) / K a (mismatch). 

5 

The 6-y-6 hairpin ImPylmPyPyPy-Y-ImPyPyPyPyPy-P-Dp, which contains six 
consecutive amino acid pairings, is unable to discriminate a single-base-pair mismatch site 5'- 
TGTTAACA-3' from a 5 ' -TGTG AAC A-3 ' match site. The hairpin polyamide Im-p- 
ImPyPyPy-Y-ImPyPyPy-p-Py-P-Dp binds to the 8-bp match sequence 5 ' -TGTG AAC A-3 1 with 
10 an equilibrium association constant of K a = 2.4 x 10 10 M" 1 and > 48-fold specificity versus the 
5' -TGTTAACA-3' single-base-pair mismatch site. 



B NSOO CID- <WO 9837066A 1 J A> 



WO 98/37066 



PCT/US98/01006 



Table 26 Equilibrium association constants (M*h for polyamides. °- c 



Polyamidc 


5'-TGTTAACA-3' 


S'-TQTGAACA-y 


Specificity^ 




2.5 x 10 9 


3-9 x 10 8 


6 




6.6 x 10* 


Z.5 x 10 8 


26 


0O+OOO 


5x 10 9 


5x tO 9 


I 




s5x 10 8 


2.4 x 10 10 


>48 



d Values reported for 1. 5, and 10 are the mean values obtained from 
three DNase 1 footprint titration experiments. 6 The assays were carried 
out at 22 °C at pH 7.0 in the presence of 10 raM Tris-HCl, 10 mM KC1, 
10 mM MgCl 2 , and 5 mM CaCIj. ' Match site association constants 
and specificities higher than parent hairpins are shown in 
boldtype. ^Specificity is calculated as ^(match] / K a ( mismatch). 



Modeling indicates that the P-alanine residue relaxes ligand curvature, providing for 
optimal hydrogen bond formation between the floor of the minor groove and both Im-residues 
within the Im-P-Im polyamide subunit. This observation provided the basis for design of a 
hairpin polyamide, Im-p-ImPy-y-Lm-p-ImPy-P-Dp, which incorporates Im/p pairings to 
recognize a "problematic" 5'-GCGC-3' sequence at subnanomolar concentrations. 

Table 27 Equilibrium association constarnts (M**) for poiyamides. a " fr 



Polyamide 5'-TGCGCA-3' 5V-TGGCCA-3* 5'-TGGGGA-3' 

D C QtOI^ 3.7 xlO 9 1.4x10* 1.1 xlO 8 

a Values reponed are the mean values obtained from a minimum of three 
DNase 1 footprint titration experiments. * b The assays were carried out at 
22 °C at pH 7.0 in the presence of 10 mMi Tris-HCl, 10 mM KC1, 10 mM 
MgCl 2 , and 5 mM CaCU. 

10 These results identify Im/p and p/Im pairings that respectively discriminate G*C and 

C'G from A«T/T # A as well as Py/p and p/Py pairings that discriminate A # T/T # A from 
G*C/OG. These aliphatic/aromatic amino acid pairings will facilitate the design of hairpin 
polyamides which recognize both a larger binding site size as well as a more diverse sequence 
repertoire. 



15 



EXAMPLE 10: 
POLYAMIDE BIOTIN CONJUGATES 

Bifunctional conjugates prepared between sequence specific DNA binding polyamides 
and biotin are useful for a variety of applications. First, such compounds can be readily attached 
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to a variety of matrices through the strong interaction of biotin with the protein streptavidin. 
Readily available strepdavidin-derivatized matrices include magnetic beads for separations as 
well as resins for chromatography. 

A number of such polyamide-biotin conjugates have been synthesized by solid phase 
synthetic methods outlined in detail above. Following resin cleavage with a variety of diamines, 
the polyamides were reacted with various biotin carboxylic acid derivatives to yield 
Afunctional conjugates. The bifunctional conjugates were purified by HPLC and characterized 
by MALDI-TOF mass spectroscopy and 'H NMR. 



The scheme for the synthesis of an exemplary biotin-polyamide conjugate is shown 



below. 



o 

x 

HN NH 



Resin 



-p-Py-Py-Py-lm-Y-Py-Py-Py-lrm 



H 2 N 



V NH 2 




DMF, D1EA I2h RT 
0 




° M 



V 7T N >-\ h 



N 

\ O 




The foregoing is intended to be illustrative of the present invention, but not limiting. 
Numerous variations and modifications of the present invention may be effected without 
departing from the true spirit and scope of the invention. 
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What is claimed is: 

1 . In a polyamide having at least three consecutive carboxamide pairs for 
binding to at least three DNA base pairs in the minor groove of a duplex 
DNA sequence having at least one A*T or T»A DNA base pair, the 
improvement comprising selecting a Hp/Py carboxamide pair to 
correspond to a T>A base pair in the minor groove of the duplex DNA 
sequence or selecting a Py/Hp carboxamide pair to bind to an A«T DNA 
base pair in the minor groove of the duplex DNA sequence. 

2. The polyamide of claim I wherein at least four consecutive carboxamide 
pairs bind to at least four DNA base pairs. 

3. The polyamide of claim 1 wherein at least five consecutive carboxamide 
15 pairs bind to at least five DNA base pairs. 

4. The polyamide of claim 1 wherein at least six consecutive carboxamide 
pairs bind to at least six DNA base pairs. 

20 5. The polyamide of claim 1 wherein the A»T or T»A base pair has a G»C 

or OG base pair on either side. 

6. The polyamide of claim 1 wherein the duplex DNA sequence is a 
regulatory sequence. 

25 

7. The polyamide of claim 1 wherein the duplex DNA sequence is a 
promoter sequence. 

8. The polyamide of claim 1 wherein the duplex DNA sequence is a coding 
30 sequence. 

9. The polyamide of claim 1 wherein the duplex DNA sequence is a non- 
coding sequence. 

35 10. The polyamide of claim 1 wherein the binding of the carboxamide pairs 

to the DNA base pairs modulates the expression of a gene. 
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11. A composition comprising an effective amount of the polyamide of claim 
1 and a pharmologically suitable excipient. 

12. A diagnostic kit comprising the polyamide of claim 1 . 

13. A polyamide according to claim 1 having the formula: 

X1X2X3X4-Y-X5X6X7X8 
wherein y is -NH-CH2-CH2-CH2-CONH- hairpin linkage derived from 
y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X4/X5, X3/X6, X2/X7, and X\/Xs represent 
carboxamide binding pairs which bind the DNA base pairs wherein at 
least one binding pair is Hp/Py or Py/Hp and the other binding pairs are 
selected from Py/Im Im/Py to correspond to the DNA base pair in the 
minor groove to be bound. 

14. The polyamide of claim 13 wherein there is at least one (3-alanine in a 
non- Hp containing binding pair. 

15. The polyamide of claim 13 wherein dimethylaminopropylamide is 
covalently bound to Xi or X8. 

16. A polyamide selected from those listed in Tables 9-24 as 
compounds 1 through 240. 

17. A polyamide selected from shown in Fig. 4. 
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